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^ (54) Title: CRYSTALS OF CYTOCHROME P450 2C9, STRUCTURES THEREOF AND THEIR USE 
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0(57) Abstract: The present invention provides cytochrome 2C9 proteins which have been modified to introduce a proline residue at 
positions 220 or 222 of the wild type sequence which can be crystallised to provide high resolution structures. The structures may be 
^ used for homology modelling of other cytochrome P450 structures such as 2C8, 2C18 and 2C19, and for analysis of the interaction 
^ of hgands with P450. 
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CRYSTALS OF CYTOCHROME P 4 50 2C9. STRf irrURES THEREOF AND THEIR USE 
Field of the Invention. 

The present invention relates to the human cytochrome P450 protein 2C9, methods for its 
crystallization, its X-ray crystal structure and the use thereof. 

Background to the Invention. 

Cytochrome P450s (CYP450) form a very large and complex gene superfamily of hemeproteins 
that metabolise physiologically important compounds in many species of microorganisms, 
plants and animals. Cytochrome P450s are important in the oxidative, peroxidative and 
reductive metabolism of numerous and diverse endogenous compounds such as steroids, bile, 
fatty acids, prostaglandins, leukotrienes, retinoids and lipids. Many of these enzymes also 
metabolise a wide range of xenobiotics including drugs, environmental compounds and 
pollutants. Their involvement in drug metabolism is extensive, it is estimated that 50% of all 
known drugs are affected in some way by the action of CYP450 enzymes. Significant resource 
is employed by the pharmaceutical industry to optimise drug candidates in order to avoid their 
detrimental interactions with the CYP450 enzymes. Ahother level of complication results from 
the fact that these enzymes exhibit different tissue distributions and polymorphisms between 
individuals and ethnic populations 



Most mammalian P450s are located in the liver^ but other organs and tissues have high 
concentrations of certain cytochrome P450s, including the intestinal wall, lung, kidney, adrenal 
cortex and nasal epithelium. Mammals have about 50 unique CYP450 genes and each family 
member is 45-55 KDa in size and contains a heme moiety that catalyses a two-electron 
activation of oxygen. The source of electrons may be used to classify CYP450s. Those that 
receive electrons in a three protein chain in which electrons flow from a flavin adenine 
dinucleotide (FAD) containing reductase, to an iron-sulphur protein, and then to P450 belong to 
the group of class I P450s, and include most of the bacterial enzymes. Class II P450s receive 
electrons from a reductase containing both FAD and flavin mononucleotide (FMN), and 
comprise the microsomal P450s that are the main culprits of drug metabolism. The mammalian 
microsomal cytochrome P450s are integral membrane proteins anchored by an N-terminal 
transmembrane spanning a-helix. They are inserted in the membrane of the endoplasmic 
reticulum by a short, highly hydrophobic N-terminal segment that acts as a non-cleavable signal 
sequence for insertion into the membrane. The remainder of the mammalian cytochrome P450 
protein is a globular structure that protrudes into the cytoplasmic space. Hence, the bulk of the 
enzyme faces the cytoplasmic surface of the lipid bilayer. P450s require other membranous 
enzymatic components for activity including the flavoprotein NADPH-cytochrome P450 
oxidoreductase and, in some cases, cytochrome b5. A single cytochrome P450 oxidoreductase 
supports the activity of all the mammalian microsomal enzymes by interacting directly with the 
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P450s and transferring the required two electrons from NADPH. Cytochrome P450s are able to 
incorporate one of the two oxygen atoms of an 0 2 molecule into a broad variety of substrates 
with concomitant reduction of the other oxygen atom by two electrons to H 2 0. Cytochrome 
P450 are known to catalyse hydroxylations, epoxidation, N-, S-, and O-dealkylations, N- 
5 oxidations, sulfoxidations, dehalogenations, and other reactions. 

The genes of the P450 superfamily have been categorized by Nelson et al (Pharmacogenetics, 6; 
1 -42, 1 996) who proposed a systematic nomenclature for the family members. This 
nomenclature is used widely in the art, and is adopted herein. Nelson et al provide cross- 
references to sequence database entries for P450 sequences. 

Homo sapiens has 1 7 cytochrome P450 gene families and 42 subfamilies that total more than 50 
sequenced isoforms. Cytochrome P450s from families 1, 2 and 3 constitute the major pathways 
for drug metabolism. Many drugs rely on hepatic metabolism by cytochrome P450s for 
clearance from the circulation and for pharmacological inactivation. Conversely, some drugs 
have to be converted in the body to their pharmacologically active metabolites by P450s. Many 
promising lead compounds are terminated in the development phase due to their interaction with 
one or more P450s. One of the greatest problems in drug discovery is the prediction of the role 
of cytochrome P450s on the metabolism or modification of drug leads. Early detection of 
metabolic problems associated with a chemical lead series is of paramount importance for the 
pharmaceutical industry. Obtaining crystal structures of the main human drug metabolising 
cytochrome P450s would be highly valuable for drug design, as this would provide detailed 
information on how P450 enzymes recognize drug molecules and the mode of drug binding. 
This in turn would allow drug companies to develop strategies to modify metabolic clearance 
and decrease the attrition rates of compounds in development. 

The major human CYP450 isoforms involved in drug metabolism are CYP1 A2, CYP2C9, 
CYP2C19, CYP2D6 and CYP3A4. The level of sequence identity between these family . 
members ranges from about 20-80%, with much of the variability within the residues involved 
in substrate recognition. CYP450 enzymes are also present in bacteria and much of the 
understanding of substrate recognition is derived from crystal structures obtained of bacterial 
CYP450 enzymes. 

It is well-known in the art of protein chemistry, that crystallising a protein is a chancy and 
difficult process without any clear expectation of success. It is now evident that protein 
crystallization is the main hurdle in protein structure determination. For this reason, protein 
crystallization has become a research subject in and of itself, and is not simply an extension of 
the protein crystallographer's laboratory. There are many references which describe the 
difficulties associated with growing protein crystals. For example, Kierzek, A.M. and 
Zielenkiewicz, P., (2001), Biophysical Chemistry, 91, 1-20, Models of protein crystal growth, 
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and Wiencek, J.M. (1999) Annu. Rev. Biomed. Eng., 1, 505-534, New Strategies for crystal 
growth. 

It is commonly held that crystallization of protein molecules from solution is the major obstacle 
in the process of determining protein structures. The reasons for this are many; proteins are 
complex molecules, and the delicate balance involving specific and non-specific interactions 
with other protein molecules and small molecules in solution, is difficult to predict. 

Each protein crystallizes under a unique set of conditions, which cannot be predicted in advance. 
Simply supersaturating the protein to bring it out of solution may not work, the result would, in 
most cases, be an amorphous precipitate. Many precipitating agents are used, common ones are 
different salts, and polyethylene glycols, but others are known. In addition, additives such as 
metals and detergents can be added to modulate the behaviour of the protein in solution. Many 
kits are available (e.g. from Hampton Research), which attempt to cover as many parameters in 
crystallization space as possible, but in many cases these are just a starting point to optimise 
crystalline precipitates and crystals which are unsuitable for diffraction analysis. Successful 
crystallization is aided by a knowledge of the proteins behaviour in terms of solubility, 
dependence on metaliohsfor correct folding or activity, interactions with other molecules and 
any other information that is available. Even so, crystallization of proteins is often regarded as a 
time-consuming process, whereby subsequent experiments build on observations of past trials. 

In cases whereprotein crystals are obtained, these are not necessarily always suitable for 
diffraction analysis; they may be limited in resolution, and it may subsequently be difficult to 
improve them to the point at which they will diffract to the resolution required for analysis. 
Limited resolution in a crystal can be due to several things. It may be due to intrinsic mobility 
of the protein within the crystal, which can be difficult to overcome, even with other crystal 
forms. It may be due to high solvent content within the crystal, which consequently results in 
weak scattering. Alternatively, it could be due to defects within the crystal lattice which mean 
that the diffracted x-rays will not be completely in phase from unit to unit within the lattice. 
Any one of these or a combination of these could mean that the crystals are not suitable for 
structure determination. 

Some proteins never crystallize, and after a reasonable attempt it is necessary to examine the 
protein itself and consider whether it is possible to make individual domains, different N or C- 
terminal truncations, or point mutations. It is often hard to predict how a protein could be re- 
engineered in such a manner as to improve crystallisability. Our understanding of crystallisation 
mechanisms are still incomplete and the factors of protein structure which are involved in 
crystallisation are poorly understood. 
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As of 2000, eight cytochrome P450 structures have been solved by X-ray crystallography and 
are available in the public domain. All of the cytochrome P450s, whose structures have been 
solved, were expressed in R coli. Six structures correspond to bacterial cytochrome P450s: 
P450cam(CYP101 Poulos era/., 1985, J. Biol. Chem.,260, 16122), the hemeprotein domain of 
5 P450BM3 (CYP102, Ravichandran et al, 1993, Science, 261, 731), P450terp (CYP108, 
Hasemannera/., 1994,./ AM Biol. 236, 1169), P450eryF (CYP107A1, Cupp-Vickery and 
Poulos, 1995, Nature Struct. Biol. 2, 144), P450 14o-sterol demethylase (CYP51, Podust et al., 
2001, Proc. Natl. Acad. Sci. USA, 98, 3068) and the crystal structure of a thermophilic 
cytochrome P450 (CYP1 19) from Archaeon sulfolobus solfataricus was solved (Yano era/., 
10 2000, J. Biol. Chem. 275, 3 1 086). The structure of cytochrome P450nor was obtained from the 
denitrifying fungus Fusarium oxysporum (Shimizu et al. 2000, J. Inorg. Biochem. 81,191). The 
eighth structure is that of the rabbit 2C5 isoform, the first and only structure of a mammalian 
cytochrome P450 (Williams et al. 2000, Mol. Cell. 5,121). 

1 5 The reason why the mammalian cytochrome P450s have been particularly difficult to crystallize, 
compared to their bacterial counterparts, resides in the nature of these proteins. The bacterial 
cytochrome P450s are soluble whereas the mammalian P450s are membrane-associated 
proteins. Thus, structural.studies on mammalian cytochrome P450s may use the combination of 
heterologous expression systems that allow expression of single cytochrome P450s at high 

20 concentration with modification of their sequences to improve the solubility and the behaviour 
of these proteins in solution. 

Due to significant sequence differences from both the bacterial proteins and rabbit proteins, to 
fiilly understand the role of the human C YP450 enzymes in drug metabolism, the crystal 
25 structures of human isoforms are still required. 

Ibeanu et al„ (1996), J Biol Chem, Vol. 271, 12496-12501 describe the production of modified 
2C9 proteins in yeast in which certain residues, including Ser 220 and Pro 221, were altered. 
These altered proteins were found to exhibit 2C1 9-like activity for omeprazole. The proteins 
30 retained wild-type N-terminal sequence. 

Disclosure of the Invention. 

A nomenclature has been adopted to describe the secondary structure observed in cytochromes 
P450. The authors of the first structure of a P450, P450cam denoted the 12 helical segments A 
35 through L from the N-terminal to C-terminal direction and this naming has been continued as 
more structures have been determined. In addition, some P450 structures have shown more 
helices, for example the description of P450 BM3 details 15 helices (A, B , B\ C, D, E, F, G, H, 
I, J, J\ K, K', L), where the additional helices are indicated by the 1 symbol. 



WO 03/035693 PCT/GB02/04872 

5 

Each helix is typically. 6 amino acids (Helix H in 2C5) to 32 amino acids (Helix I in BM3) in 
length. The helices are linked by (3-strands, short linkers and long flexible loops of up to 30 
amino acids in length. 

Among these flexible structures, one of the most pronounced is the loop between the F and G 
helices ("the F-G loop"). This loop is probably involved in the substrate access channel, and 
could move to accommodate the substrate in the active site. This loop has also been described 
as participating, with the N-terminus domain, to the anchorage of the cytochrome P450s to the 
membrane. 

In the 2C5 structure (PDB ID 1DT6) helix F ends at residue 206 and helix G starts at residue 
228, therefore the loop between is residues 207 to 227 (definitions from the secondary structure 
assignment program DSSP (Kabsch and Sander, Biopolymers 22 (1983) 2577-2637)). Of these 
21 residues, 12 cannot be seen in the 2C5 structure. This is an indication of its flexible nature. 
We predict it will be similar in the rest of the 2C family and in other human P450s. The region 
is thought to be involved in membrane association of the enzyme in vivo, and orientation of the 
substrate access channels. 

Using the models we have developed; we predicted the F-G loop in 2C9 was from Leu208 to . 
Pro227. This region contains 20 residues^ 11 of which can be classified as hydrophobic, further 
supporting the hypothesis that this region may be embedded in membranes or involved in 
aggregation as has been suggested. From the 2C9 structure of the invention, (definitions from 
DSSP), the loop between helixes F and G actually starts at 209 and ends at 227. 

The invention provides modified human 2C9 P450 proteins as described herein, and nucleic acid 
encoding such proteins, as well as the use of the nucleic acids in making the proteins. 

The invention further provides methods for the production and purification of the 2C9 P450 
proteins of the invention. 

The invention also provides crystals of the modified 2C9 P450 proteins of the invention. . 

The present invention further relates to the crystal structure of human C YP450 2C9, which 
allows the binding location of the substrates in the enzyme to be investigated and determined. 

In general aspects, the present invention is concerned with the provision of a P450 structure and 
its use in modelling the interaction of molecular structures, e.g. potential pharmaceutical 
compounds, with this structure. 
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The above aspects of the invention, both singly and in combination, all contribute to features of 
the invention which are advantageous. 

Description of the Drawings 
5 Figure 1 sets out Table 1 , providing the coordinates of a 2C9 structure, 

Figure 2 sets out Table 2, providing the coordinates of a 2C9-FGloop K206E structure, 

Figure 3 sets out Table 3, providing the coordinates of a 2C9-FGloop structure, and 

10 

Figure 4 sets out Table 7, providing modelled coordinates of residues 215, 216, 220, 221, 222, 
and 223 of a 2C9 wild type protein. 

Figure 5 sets out Table 8, providing a refined structure of 2C9-FGloop K206E. 

15 

Figure 6 sets out Table 1 1, showing conditions in which crystals of proteins of the invention 
were obtained. 

. Figure 7 sets out Table 1 8, showing a homology model of 2C1 9. 

Figure 8 sets out Table 1 9, showing 2C 1 8 replacement coordinates. 

Figure 9 sets out Table 20, showing 2C8 replacement coordinates. 

25 Figure 10 shows the sequence alignment of the N-terminal truncated 2C9 variants and 2C9trunc 
with the published 2C9 wild type sequence (Meehan et al. 1988, ^m. J. Hum. Genet. 42, 26). 

Figure 1 1 shows data from 4-diclofenac hydroxylase assays. 

30 Description of Tables. 

Table 1 (see Fig. 1) provides the coordinates of the 2C9 structure obtained in Example 9. 
Table 2 (see Fig. 2) provides the coordinates of the 2C9-FGloop K206E structure obtained in 
Example 1 1 . 

Table 3 (see Fig. 3) provides the coordinates of 2C9-FGloop obtained in Example 12. 
35 Table 4 (Example 13) lists residues that line the binding site of 2C9, 

Table 5 (Example 13) lists residues previously inferred to be in the binding site. 
Table 6 (Example 13) lists newly identified binding pocket residues. 

Table 7 (see Fig. 4) provides modelled coordinates of residues 215, 216, 220, 221, 222, and 223 
of a 2C9 wild type protein. 
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Table 8 (Example 1 6 and Figure 5) is a refined 2C9-FGloop K206E structure. 

Table 9 (Example 17) describes further 2C9 proteins of the invention and the primers and 

methods used to obtain them. 

Table 10 (Example 17) describes control 2C9 proteins and the primers and methods used to 
5 obtain them. 

Table 1 1 (Examples 20 and 24, and Figure 6) shows crystallisation conditions for proteins of the 
invention. 

Table 12 (Example 18) sets out mass spectrometry data for 2C9 proteins. 
Table 13 (Example 19) shows activity data for 2C9 proteins of the invention. 
10 Table 14 (Example 21) shows 2C9-2C19 chimeras of the invention and the primers and/or 
methods used to obtain them. 

Table 15 (Example 22) sets out mass spectrometry data for 2C9-2C19 chimeric proteins. 
Table 16 (Example 23) shows activity of 2C9-FGloop K206E (1 155). 
Table 17 (Example 24) shows activity of 2C9-2C19 chimeras of the invention. 
1 5 Table 1 8 (Example 25 and Figure 7) sets out a homology model of 2C 1 9. 
Table 19 (Example 26 and Figure 8) shows 2C 18 replacement coordinates. 
Table 20 (Example 27 and Figure 9) shows 2C8 replacement coordinates. 

Description of sequences. 

20 SEQ ID NO: 1 is DNA sequence of 2C9trunc. 

SEQ ID NO:2 is the amino acid sequence of 2C9trunc. 

SEQ ID NO:3 is the DNA sequence of 2C9-P220 (also referred to as 1072). 

SEQ ID NO:4 is the amino acid sequence of 2C9-P220. 

SEQ ID NO:5 is the DNA sequence of 2C9-FGloop (also referred to as 101 5). 
25 SEQ ID NO:6 is the amino acid sequence of 2C9-FGIoop. 

SEQ ID NO:7 is the DNA sequence of 2C9-FGloop K206E (also referred to as 1 155). 

SEQ ED NO:8 is the amino acid sequence of 2C9-FGloop K206E. 

SEQ ID NOs:(2x+7) and (2x+8) where x is an integer from 1 to 49 are the DNA and amino acid 
sequences, respectively, of the 2C9 proteins referred to as clones 1078, 1081, 1082, 1085, 1097, 

30 1100, 1101, 1102, 1115, 1116, 1117, 1118, 1121, 1122, 1123, 1165, 1220, 1319, 1339, 1340, 
1361, 1362, 1363, 1364, 1366, 1367, 1368, 1369, 1370, 1371, 1372, 1391, 1392, 1394, 1396, 
1397, 1424, 1443, 1444, 1475, 1477, 1491, 1595, 1600, 1610, 1632, 1661, 1662 and 1664 
respectively. Thus the DNA sequence encoding clone 1078 is SEQ ID NO:9 and its 
corresponding amino acid sequence is SEQ ID NO: 10, and for clone 1664 the DNA is SEQ ID 

35 NO: 105 and the corresponding amino acid sequence is SEQ ID NO: 106. 
SEQ ID NO: 107 is the DNA sequence of clone 1039 (control clone). 
SEQ ID NO: 108 is the amino acid sequence of clone 1039. 
SEQ ID NO: 109 is the DNA sequence of clone 1365 (control clone). 
SEQ ID NO:l 10 is the amino acid sequence of clone 1 365. 

40 SEQ ID NO.i 1 1 is the DNA sequence of clone 1423 (control clone). 
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SEQ ID NO:l 12 is the amino acid sequence of clone 1423. 
Further sequences are identified in the accompanying text. 

Detailed Description of the Invention. 

5 A. 2C9 Proteins and their Production. 

The sequence of 2C9 is available in the art, for example from a number of database sources 
cited in Nelson et al, 1996, ibid. This includes the SwissProt database, in which 2C9 is entry 
number PI 1712. 

10 The 2C9 P450 protein is desirably truncated in its N-terminal region to delete the hydrophobic 
trans-membrane domain, and the region replaced by a short (e.g. 8 to 12 amino acid sequence 
containing one or more (e.g. 3, 4 or 5) positively charged amino acids. For expression of the 
human 2C9 P450, we have used an N-terminal sequence MAKKTSSKGR (SEQ ID NO:l 14) in 
place of the N-terminal 29 amino acid residues, which increases expression of the proteins in E. 

1 5 coli and increases solubility. 

The 2C9 P450 may optionally comprise a tag, such as a C-terminal polyhistidine tag to allow for 
recovery and purification of the protein. 

20 We have found that the position of the proline residue in the F-G loop appears to play a 
significant role in the formation of a P450 crystal. In particular, the presence of a proline at 
position 220 or 222 in 2C9 appears to be important for crystallisation to occur. 

In 2C9 wild type there is a proline residue at position 221. Moving it to position 220, by . 
25 substituting position 220 by proline and removing the Pro221 (by substitution by any other 
residue, but preferably alanine or threonine) in 2C9 promotes crystallisation. Alternatively the 
proline may be moved to position 222, with position 221 likewise being substituted. 

In 2C9 we have made the changes to positions 220 and 221 with and without other changes. 
30 Where other changes were made, these were 12 1 5 V, C2 1 6 Y, I222L and I223L, although it is not 
essential that any or all of these be made to provide for crystallisation. 

Our experiments have been based on the use of a particular N-terminal truncation of 2C9, as set 
out in SEQ ID NO:2 and shown in Figure 1 0. This protein also comprises a polyhistidine tag at 
35 the C-terminus. The N-terminal truncation and tag are both features which can be varied by 
those of skill in the art using routine skill. For example, alternative N-terminal sequence might 
be utilised, for example for production in host cells other than E. coli. Likewise, other tags may 
be used for purification of the protein as described below. These N- and C-terminal 
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modification may be made to a 2C9 protein which retains the core sequence of residues 3 1-490 
of the wild type sequence illustrated in Figure 10. 

The present invention provides a P450 2C9 protein which comprises the following changes: 
5 position 220 or position 222 is proline; and 

optionally up to 30, for example up to 25, for example up to 10, for example up to 5 
other positions are altered, 

the positions 220 and 222 being numbered according to wild type 2C9. This numbering 
is shown in Figure 10, 

10 

Preferably the change is to position 220. 

It will be appreciated from the discussion above that by 2C9 protein, it is meant a protein 
comprising residues 31 to 490 of the wild type sequence, optionally with N- and/or C-terminal 
1 5 sequences provided to facilitate expression and recovery of the protein. 

Where present, the N-terminal sequence is preferably not the wild-type sequence. Preferably, it 
is shorter that the wild type sequence (which is 30 amino acids). Preferably, the N-terminal 
region joined to residue 31 is the truncation illustrated in the accompanying examples, i.e. SEQ 
20 ID NO:l 14 plus a proline residue between it and residue 31 (also proline). This type of N- 
terminal sequence reduces the tendency of 2C9 to anchor to membranes and to aggregate 
compared to the wild type sequence. V 

Where present, the C-terminal sequence is preferably no larger than 30, and preferably no larger 
25 than 1 0 amino acids in size. 

In a preferred aspect, one of the up to 30 changes is to the position 22 1 , such that it is not 
proline. However this is not essential as it is shown herein (clone 1078) that crystals can be 
obtained with proline at position 221 as long as one of the changes made above is also included. 

30 

A particular advantage of the proteins of the invention is that they are crystallisable. That is, we 
have found that we have been able to form crystals which diffract X-rays, and thus we have 
been able to analyse these crystals to provide structural coordinate data at a resolution of 3.1 A or 
better. 

35 

A further advantageous feature of the invention is that we have been able to obtain crystals of a 
P450 protein in the absence of a ligand. Such crystals are useful for screening ligands with a 
view towards determining co-complex structures. Determining the molecular structure of 2C9 
can also be used in computer-based in silico ligand screening, 

40 
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We have also shown additional changes to the 2C9 wild type sequence in addition to the 
changes at any of 220-222 may be introduced. A number of specific changes are illustrated in 
the clones of 2C9 set out in the accompanying examples. These include: 

5 - changes to the FG loop region. A number of clones have such changes, including the 
clone 2C9-FGloop (3 changes), clone 1363 (3 changes), clones 1361, 1362, 1364 
1369 (2 changes), and clones 1366, 1371 (1 change). 

changes to the surface region of 2C9. Such changes are illustrated herein by clone 1 123 
10 (3 changes), clones 1 102, 1340, 1397, 1443 (2 changes), and clones 1081, 1082, 1085, 1097, 
1100, 1101, 1115, 1116, 1117, 1118, 1121, 1122, 1165, 1339, 1391, 1392, 1394, 1396(1 
change). These surface changes may be in addition to the FG loop changes. 

up to 20 changes in total on top of changes to positions 220 and 221; e.g. clone 1595 (20 
15 changes), 1600 (13), and 1632 (11). 

Thus clone 1 595 has 22 changes form wild type in total- 6 in FG loop (including 220, 221), 3 in 
active site, 12 on the surface. Of these 9 are conserved changes and 13 are non-conserved. 

20 Our data illustrate that a variety of other positions in addition to the specific 220 or 222 changes 
may made while still providing a protein which can be crystallised. 

In one aspect, the changes which may be introduced are changes which introduce residues found 
in the corresponding position in another cytochrome P450 molecule. The corresponding 
25 position may be found by alignment of the other P450 molecule with the sequence of 2C9 wild 
type to maximise homology between the two. The changes may be particularly from another 
cytochrome P450 molecule selected from the group consisting of 2C19, 2C18 and 2C8. 
Example 21 below sets out the production of proteins in which residues from 2C19 are 
substituted into the 2C9 sequence. 

30 

Examples 25 to 27 illustrate homology modelling of the proteins 2C19, 2C1 8 and 2C8 
respectively. The Tables accompanying these examples may be used to identify the residues of 
these proteins which may be substituted into 2C9. 

35 In another aspect, the invention provides a protein which is selected from the group consisting 
of SEQ ID NO:(2x+2), wherein x is an integer from 1 to 52. These proteins all share the 
common feature of the introduction of a proline residue at position 220 or 222 which facilitates 
crystallisation of 2C9. 
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Expression and Recovery of P450 

The 2C9 P450 proteins of the invention are produced by recombinant DNA techniques. The 
nucleic acid sequences which encode wild type P450 proteins are available in the art, and the 
person of skill in the art may use routine methodology, e.g. site-directed mutagenesis, to 
5 introduce coding changes into the nucleic acid sequences so as to provide nucleic acids 
encoding the P450s of the invention. 

Thus in another aspect, the invention provides an isolated nucleic acid encoding a 2C9 P450 
protein of the invention. Nucleic acid includes DNA (including both genomic and cDNA) and 
10 RNA. Nucleic acid of the invention may be single or double stranded polynucleotides. 

Nucleic acids of the invention can be incorporated into a recombinant replicable vector. The 
vector may be used to replicate the nucleic acid in a compatible host cell. Thus in a further 
embodiment, the invention provides a method of making nucleic acids of the invention by 
15 introducing a nucleic acid of the invention into a replicable vector, introducing the vector into a 
compatible host cell, and growing the host cell under conditions which bring about replication of 
the vector. The vector may be recovered from the host cell. 

Preferably, a nucleic acid of the invention in a vector is operably linked to a control sequence 
20 which is capable of providing for the expression of the coding sequence by the host cell, i.e. the 
vector is an expression vector. 

The term "operably linked" refers to a juxtaposition wherein the components described are in a 
relationship permitting them to function in their intended manner. A control sequence "operably 
25 linked" to a coding sequence is ligated in such a way that expression of the coding sequence is 
achieved under condition compatible with the control sequences. 

Suitable vectors can be chosen or constructed, containing appropriate regulatory sequences, 
including promoter sequences, terminator fragments, polyadenylation sequences, enhancer 
30 sequences, marker genes and other sequences as appropriate. Vectors may be plasmids, viral 
e.g. "phage phagemid or baculoviral, cosmids, YACs, BACs, or PACs as appropriate. 

The vectors may be provided with an origin of replication, optionally a promoter for the 
expression of the said polynucleotide and optionally a regulator of the promoter. The vectors 
35 may contain one or more selectable marker genes, for example an ampicillin resistance gene in 
the case of a bacterial plasmid or a neomycin resistance gene for a mammalian vector. Vectors 
may be used in vitro, for example for the production of RNA or used to transfect or transform a 
host cell. Systems for cloning and expression of a polypeptide in a variety of different host cells 
are well known. 
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Promoters and other expression regulation signals may be selected to be compatible with the 
host cell for which the expression vector is designed. For example, bacterial promoters include 
the lacZ promoter, yeast promoters include S. cerevisiae GAL4 and ADH promoters, S. pombe 
nmtl and adh promoter, and mammalian promoters include the metallothionein promoter, the 
SV40 large T antigen promoter or adenovirus promoters. 

For further details see, for example, Molecular Cloning: a Laboratory Manual: 3rd edition, 
Sambrook et al., 2001, Cold Spring Harbor Laboratory Press. Many known techniques and 
protocols for manipulation of nucleic acid, for example in preparation of nucleic acid constructs, 
mutagenesis, sequencing, introduction of DNA into cells and gene expression, and analysis of 
proteins, are described in detail in Current Protocols in Molecular Biology, Ausubel et al. eds., 
John Wiley & Sons, 1992. 

A further embodiment of the invention provides host cells transformed or transfected with the 
vectors for the replication and expression of nucleic acids of the invention. The cells will be 
chosen to be compatible with the said vector and may for example be bacterial, yeast, insect or 
mammalian. 

The 2C9 P450 proteins of the invention may be expressed in any suitable host cell, which a 
person of skill in the art wishes to use as a matter of experimental convenience. Cytochrome 
P450 molecules have been widely expressed in E. coli, and there are numerous vector systems 
for this host cell which may be used. 

Other host cells include yeast, e.g. S. cerevisiae, insect or mammalian, e.g. CHO, cells. 
Expression systems for these and many other host cell types are widely available in the art. 

Host cells may be constructed so that the 2C9 P450 is expressed constitutively, or is induced. 

Once the cells have been cultured to express 2C9 P450, they may be recovered by standard 
techniques available in the art. A convenient means is to recover the cells by low-speed 
centrifugation such that the cells are pelleted intact. 

The process of the present invention is suitable for batch cell culture, and batches of cells from 
1 00 ml to several, e.g. 10 litres can be conveniently handled by current laboratory equipment, 
though larger batches, e.g. 10 to 100 litres, are not excluded. 

This invention also provides a method for expression and recovery of the 2C9 human 
cytochrome P450s of the invention from host cells. This method comprises: 
(a) expressing in a host cell culture said cytochrome 2C9 P450 molecule; 
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(b) recovering said cells from said culture and suspending said cells in salt buffer having a 
conductivity of from 12 to 1 10 mS/cm; 

(c) lysing said cells and removing cell debris to provide a high-salt lysate; 

(d) adding detergent to said lysate (for example 0.015% to 1 .2% v/v) to provide a high-salt- 
5 detergent lysate; 

(e) recovering said P450 from said lysate. 

In a preferred embodiment, the method comprises: 
(a) expressing in E. coli said cytochrome 2C9 P450 molecule; 
10 (b) recovering said cells and suspending them in a 200 mM to 1000 mM salt buffer; 

(c) lysing said cells and removing cell debris to provide a high-salt lysate; 

(d) adding detergent to said lysate (for example 0.015% to 1.2% v/v) to provide a high-salt- 
detergent lysate; 

(e) recovering said P450 from said lysate. 

15 

The recovery step involves affinity purification of the 2C9 P450 from the high salt-detergent 
lysate, since the presence of the high salt rules out the alternative of an ionic exchange 
purification step. 

20 However, once the P450 has been purified by affinity chromatography, the salt must be removed 
in order to allow further purification of the product so that crystallization can be performed. In 
the prior art, salt removal is typically performed by dialysis. However, we have found that this 
process, which removes salt gradually over a period of several hours, causes aggregation and 
denaturation of the P450s and thus is undesirable. We have found that rapid desalting alleviates 

25 this problem to a significant degree. 

Thus in a further aspect, step (e) above may be performed by: 

(e(i)) binding said 2C9 P450 to an affinity support; 

(e(ii)) rinsing said support in a high-salt-detergent wash; 
30 (e(iii)) removing said 2C9 P450 in a high-salt-detergent buffer to provide a P450-high- 

salt-detergent preparation; and 

(f) exchanging the buffer to a low ionic strength buffer without detergent by size- 
exclusion chromatography to provide a P450-low-salt preparation. 

35 The above steps e(i)-(iii) maintain the 2C9 P450 in a high-salt and detergent buffer throughout 
the initial stages of the purification process, which aids the recovery of the P450. 

The preparation may be subject to additional purification and cleaning procedures, such as 
cation exchange chromatography, optionally followed by further size-exclusion chromatography 
40 or hydrophobic interaction chromatography to obtain a more purified preparation of protein. 
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Salt buffer 

This is buffer with a high ionic strength which is used to suspend the cells. It is a buffer 
comprising a salt which is readily soluble to provide a buffer having a conductivity of from 12 
5 to 1 10 mS/cm. Such a buffer is desirably a salt having a concentration in the 200 - 1000 mM 
range. Preferably the salt is a potassium or sodium salt of an anion. Desirably the anion may be 
chloride or phosphate. Potassium phosphate (KPi) is particularly preferred. 

A preferred salt concentration is selected to provide a conductivity of 25 to 35 mS/cm, for 
10 example about 30 mS/cm. A particularly preferred salt concentration is around 500 mM, e.g. 
500 + 50mM. 

The buffer will be maintained at a pH range of from 6.5 to 8.0, preferably from 7.0 to 7.6. The 
buffer may contain other reagents used conventionally in the art for protein purification, such as 
15 glycerol, p-mercaptoethanol, DNase, pH buffering agents, histidine, imidazole and protease 
inhibitors. 

Cell lysis 

Cells may be lysed by physical means, such as sonication or in a French press or continuous 
20 flow cell disrupter, such that the cell walls are broken and the contents of the cells dispersed in 
the salt buffer. To achieve this in a French press, this may be operated at 1 0,000 to 20,000 psi. 

Cell debris is removed (for example by low-speed centrifugation at about 10,000 -25,000 g (e.g. 
about 22,000 g or a short high speed centrifugation to 70000 g; i.e. such that any whole cells are 
25 pelleted but not the membrane fraction). The debris (e.g. pelleted cells) may be subject to a 
further round of lysis, and the debris-free lysate from this further round combined with the 
lysate obtained previously. 

The lysate is then ready to use directly in the next stage of the process, without the need to 
30 isolate a membrane fraction by ultracentrifugation. 

Use of detergent 

Once the lysate has been obtained, it is desirable that the detergent be added to the lysate as soon 
as possible, taking account of the constraints of the experimental set up. This will mean that the 
35 detergent is added to the lysate within 1 hour, preferably within 30 minutes or less of the 
preparation of the debris-free lysate. 

The detergents that may be used are those conventionally used in the art of molecular and cell 
biology for the recovery and processing of biological materials. A large number of different 
40 types of detergents are available for this purpose. Many of these detergents are those of a 
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molecular weight range of from about 350 to 1000, such as from 400 to 800. They include 
anionic surfactants such as cholic acid or salts thereof (e.g. the sodium salt) and deoxycholic 
acid or salts thereof (e.g. the sodium salt) as well as zwitterionic surfactants such as CHAPS (3- 
[(3-cholamidopropyl)dimethylarrunonio]- 1 -propane sulphonate). 

5 

A particularly preferred class of detergents are non-ionic detergents. There are a wide variety of 
non-ionic detergents available in the art. Non-ionic detergents include octyl-p-D- 
glucopyranoside and ethers, such as C2-10 alkylphenol ethers, of polylethylene glycol. Such 
compounds may be of a molecular weight range of 500 - 800 Da, and include Nonident™ P40, 
10 IGEPAL CA630, and Triton™ X-100, and the like, which are commercially available. 

The detergent is added to provide a usual concentration of from 0.015% to 1 .2% v/v of detergent 
in the lysate. The amount of detergent added is preferably in the range of 0.1 to 1.2%, more 
preferably about 0.2 to 0.4%, such as about 0.3%. 

15 

The detergent is added in a volume so that desirably, the concentration of salt or ionic strength 
does not decrease by more than 1 0%. 

Recovery of Purified 2C9P450 
20 We have found that the above high-salt-detergent lysate prepared in accordance with the 

invention provides for the recovery of 2C9 P450 protein in reduced aggregation form at a level 
much higher than experienced to date in the art. As mentioned above, the presence of the high 
salt buffer rules out an immediate ionic exchange chromatography step, but affinity purification 
may be performed as the next step. 

25 

Affinity purification may take the form of providing an affinity support matrix in which a ligand 
for the 2C9 P450 is attached. The support may be a resin, a bead (e.g. glass or polymer such as 
polystyrene), a magnetic bead, or the like. Where the 2C9 P450 contains a tag, the ligand will 
be cognate to the tag, e.g. Ni-NTA for a histidine tag, biotin for a streptavidin tag, etc. The 
30 ligand may also be an antibody, either to an epitope tag such as an HA tag, or to an epitope of 
the 2C9 P450. 

The lysate is brought into contact with the affinity support under conditions for the 2C9 P450 to 
bind to the support. After binding, the support is rinsed. The rinse buffer should be a high-salt- 
35 detergent buffer, which may be the same or different to the lysate buffer. Preferably it is the 
same. If different, it will still have concentrations of salt and detergent as specified above. 

After rinsing, the 2C9 P450 is removed from the support. This may be done in batch or by 
packing the support into a column and eluting the 2C9 P450 using a high-salt-detergent buffer, 
40 which is modified to remove the 2C9 P450 from its ligand. For example, for Ni-NTA, the buffer 
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may contain histidine or imidazole at a sufficient excess concentration to displace the His tag of 
the 2C9 P450. Suitable competitors may be used for other types of tags. 

Desalting 

The ionic strength of the resulting P450 solution is lowered by a rapid desalting process. We 
have found that a size exclusion column may be used for this purpose, with a flow rate such that 
the 2C9 P450 is separated from the high salt concentration within 10-30, preferably within 10 
minutes. The 2C9 P450 is loaded to the column and eluted through the column with a low salt 
buffer. 

While not wishing to be bound by any one particular theory, we have observed that whereas 
gradual desalting by, for example, dialysis, leads to aggregation and denaturation of cytochrome 
2C9 P450, the rapid desalting process reduces aggregation to a significant degree. 

The low salt buffer is preferably a similar salt to the high salt buffer described above, e.g. a 
sodium or potassium salt such as a chloride or phosphate, with potassium phosphate again being 
preferred. By "low salt", it is meant less than 50 mM, preferably less than 20 mM, and 
preferably about 10 mM. At this stage, it is not necessary to maintain detergent in the buffer. 

Further purification 

It is desirable that after the desalting step, the preparation is subject to further purification 
promptly, i.e. without storage or freezing of the sample. This can be achieved by applying the 
desalted eluate directly to a further purification column. If not, the eluate from the desalting 
process is collected and applied within 1 hour to the column. A number of techniques are known 
as such in the art for the further purification or concentration of protein preparations, and 
examples of these are outlined in the accompanying examples. They include weak cation 
exchange columns, such as carboxymethyl-Sepharose™, BioRex™70, carboxymethyl- 
Biogel™, and the like, and strong cation exchange columns such as MonoS, which may be used 
to further remove detergent. For example, the desalted cytochrome P450 may be directly 
applied to a CM Sepharose™ column (e.g. a 5 ml HiTrap column, Pharmacia), previously 
equilibrated with 10 mM KPi, pH 7.4, 20% glycerol, 0.2 - 2.0 mM DTT, 1 mM EDTA ("buffer 
1"). The following step elution protocol may then be run on the AKTA FPLC system; wash with 
10-20 column volumes of buffer 1 and then 10-20 column volumes of lOmM KPi, pH 7.4, 
20% glycerol, 0.2 - 2.0 mM DTT, ImM EDTA, 75 mM KC1 or NaCl in order to remove any 
trace of detergent. The P450 is then eluted with the above latter buffer with KC1 or NaCl 
concentration increased to 500 mM. 

Optionally this is followed by a size exclusion column, e.g. Superose™, Superdex™, 
Sephacryl™, and the like. The protein recovered from either the cation exchange or size 
exclusion step may be concentrated to provide a solution suitable for crystallisation or other use. 
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A concentration of from 20 to 120, e.g. 20 to 80 mg/ml may be achieved by the use of the 
present invention. 

B. Crystallisation of 2C9 Proteins. 

5 A number of methods are known as such in the art for obtaining protein crystals. Conveniently, 
the final protein is concentrated to 10-60, e.g. 20-40 mg/ml in 10-100 mM potassium phosphate 
with high salt (e.g. 500 mM NaCl or KC1) by using concentration devices that are commercially 
available. The protein may be concentrated in presence of 20% glycerol, 2.0 mM DTT and 1 
mMEDTA. 

10 

The protein is crystallized by vapour diffusion at 5-25 °C against a range of buffer 
compositions. Crystals may be prepared using commercially available screening kits such as, 
Polyethylene glycol (PEG)/ion screens, PEG grid, Ammonium sulphate grid, PEG/ammonium 
sulphate grid or the like purchased from Hampton Research, Emerald Biostructure, Molecular 
1 5 Dimension and from others. 

Typically the vapour diffusion buffer comprises 0 - 27.5%, preferably 2.5-27.5% PEG 1K-20 K, 
preferably 1-8K or PEG 2000MME-5000MME, preferably PEG 2000 MME, or 0-10% 
Jeffamine M-600 and/or 5-20%, e.g. 10-20% propanol or 15-20% ethanol or about 15%-30%, 
20 e.g. about 15% 2-methyl-2,4-pentanedi61 (MPD), optionally with 0.01 M -1 .6 M salt or salts 
and/or 0-0.15, e.g. 0-0.1, M of a solution buffer and/or 0-35%, such as 0-15%, glycerol and/or 0- 
35% PEG300-400; but preferably:: , 

10-25% PEG 1K-8K or PEG 2000MME or 0-10% Jeffamine M-600 and/or 5-15%, e.g. 10-15%, 
25 propanol or ethanol, optionally with 0. 1 M -0.2 M salt or salts and/or 0-0. 1 5, e.g. 0-0. 1 M 
solution buffer and/or PEG400, but more preferably: 

15-20% PEG 3350 or PEG 4000 or PEG 2000MME or 0-10% Jeffamine M-600 or 5-15%, e.g. 
10-15% propanol or ethanol, optionally with 0.1 M-0.2 M salt or salts and/or 0-0.15 M solution 
30 buffer. 

Specifically preferred crystallisation conditions for the 2C9 proteins described herein are: 
0.05-0.1 M Tris-HCl pH 8.0-8.8, 0.1-0.2 M Lithium sulphate, 10-15% PEG 4000; 

35 

0.1 M Tris pH 8.0-8.8, 1 5-30% PEG 400, 5% PEG 8000, 10% glycerol; and 



0.1-0.4 M KH 2 P0 4 , 0-25 % PEG 3350, 0-10% glycerol. 
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The salt may be an alkali metal (particularly lithium, sodium and potassium), alkaline earth 
metal (e.g. magnesium or calcium), ammonium, ferric, ferrous or transition metal salt (e.g. zinc) 
of a halide (e.g. bromide, chloride or fluoride), acetate, formate, nitrate, sulphate, tartrate, citrate 
or phosphate. This includes sodium fluoride, potassium fluoride, ammonium fluoride, 
5 ammonium acetate, lithium acetate, magnesium acetate, sodium acetate, potassium acetate, 
calcium acetate, zinc acetate, ammonium chloride, lithium chloride, magnesium chloride, 
potassium chloride, sodium chloride, potassium bromide, magnesium formate, sodium formate, 
potassium formate, ammonium formate, ammonium nitrate, lithium nitrate, potassium nitrate, 
sodium nitrate, ammonium sulphate , potassium sulphate, lithium sulphate, sodium sulphate, di- 

10 sodium tartrate, potassium sodium tartrate, di-ammonium tartrate, potassium dihydrogen 
v phosphate, tri-sodium citrate, tri-potassium citrate, zinc acetate, ferric chloride, 
calcium chloride, magnesium nitrate, magnesium sulphate, sodium dihydrogen phosphate, di- 
sodium hydrogen phosphate, di-potassium hydrogen phosphate, ammonium dihydrogen 
phosphate, di-ammonium hydrogen phosphate, tri-lithium citrate, nickel chloride, ammonium 

15 iodide, di-ammonium hydrogen citrate. 

Solution buffers if present include, for example, Hepes, Tris, imidazole, cacodylate, tri-sodium 
citrate/citric acid, tri-sodium citrate/HCl, acetic acid/sodium acetate, phosphate-citrate, sodium 
potassium phosphate, 2-(N-morpholino)-ethane sulphonic acid/NaOH (MES), CHES, bis- 
20 trispropane, CAPS, potassium dihydrogen phosphate, sodium dihydrogen phosphate, 
dipotassium hydrogen phosphate or disodium hydrogen phosphate. 

The pH range is desirably maintained at pH 4.2-10.5, preferably 4.2-8.5, more preferably 4.7-8.5 
and most preferably 6.5-8.5. 

25 

Crystals may be prepared using a Hampton Research Screening kit, Poly-ethylene glycol 
(PEG)/ion screens, PEG grid, Ammonium sulphate grid, PEG/ammonium sulphate grid or the 
like. 

30 Crystallisation may also be performed in the presence of an inhibitor or substrate of P450, e.g. 
fluoroxamine or 2-phenyl imidazole. 

Additives can be added to a crystallisation condition identified to influence crystallisation. 
Additive Screens are to be used during the optimisation of preliminary crystallisation conditions 
35 where the presence of additives may assist in the crystallisation of the sample and the additives 
may improve the quality of the crystal e.g. Hampton additive Screens which use glycerol, 
polyols and other protein stabilizing agents in protein crystallisation (R. Sousa. Acta. Cryst. 
(1995) D51, 271-277) or divalent cations (Trakhanov, S. and Quiocho, F.A. Protein Science 
(1995) 4,9, 1914-1919). 



40 
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C. Crystals 

In a further aspect, the invention thus provides a crystal of human 2C9 P450 protein molecules, 
and a method of obtaining the crystal structure of a human 2C P450 molecule which comprises 
subjecting said crystal to X-rays, and analysing the diffraction pattern obtained to determine the 
5 3-dimensional coordinates of the atoms of said 2C9 P450. 

Thus the present invention provides a crystal of P450 having the trigonal space group P321 , and 
unit cell dimensions 165.46 A, 165.46 A, 1 1 1.70 A, 90°, 90°, 120°. The crystal contains two 
copies of 2C9 in an asymmetric unit, denominated at A and B in Tables 1, 2, 3 and 8. Unit cell 
10 variability of 5% may be observed in all dimensions. 

Such a crystal may be obtained using the methods described in the accompanying examples. 

The crystal may be of a 2C9 protein which comprises the sequence of SEQ ID NO:2 other the 
1 5 following changes: 

position 220 or position 222 is proline; and 

optionally up to 21, for example up to 10, for example up to 5 other positions are altered, 
the positions being numbered according to wild type 2C9, and include the sequences described 
herein in the accompanying examples; 

20 

The methodology used to provide a P450 crystal illustrated herein may be used generally to 
provide a human P450 crystal resolvable at a resolution of at least 3.1 A and preferably at least 3 
A. 

25 The invention thus further provides a crystal of a P450 protein described herein having a 
resolution of at least 3.1 A and preferably at least 3 A. 

In a further aspect, the invention provides a method for making a protein crystal of a P450 
protein described herein, which method comprises growing a crystal by vapour diffusion using a 

30 reservoir buffer that contains a potassium salt and a PEG precipitate. The growing of the crystal 
is by vapour diffusion and is performed by placing an aliquot of the solution on a cover slip as a 
hanging drop above a well containing the reservoir buffer. Preferably the potassium salt is 
potassium phosphate, particularly 0.05 to 0.2 M potassium phosphate. The PEG precipitate 
concentration is preferably 10-30% PEG (more preferably 10-20% PEG). A higher weight PEG 

35 in the range of PEG 2000 to PEG 4000 may be used. Preferably PEG 3350 is used. The aliquot 
contains protein solution and reservoir buffer, typically in a ratio of 1 part protein solution to 1 
part reservoir buffer. The protein solution was 0.7 mM. Most preferably the reservoir buffer is 
0.2 M dibasic potassium phosphate and 20% PEG 3350. Alternative crystallisation conditions 
comprise (i) 0-0.2 M Tris-HCl (pH 8-9.5, preferably pH 8.4-8.8), 0-20% PEG 400, 0-20% PEG 

40 8000, 0-20% glycerol or (ii) 0-0.2 M Tris-HCl (pH 8-9), 0-0.25 M Li 2 S0 4 , 0-20% PEG 4000; 
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more particularly (iii) 0.1 M Tris-HCl (pH 8.8), 15% PEG 400, 5% PEG 8000, 10% glycerol, 
(iv) 0.1 M Tris-HCl (pH 8.5), 0.2 M Li 2 S0 4 , 15% PEG 4000 or (v) 0.1 M Tris-HCl (pH 8.4), 
15% PEG 400, 5% PEG 8000, 10% glycerol. Condition (iv) is particularly preferred. 

5 A total of 2648 crystallisation wells were set up to obtain a 3.0 A dataset suitable for the 

solution of the first structure. A further 1584 wells were set up to achieve a crystal resolvable to 
2.6 A. This is an indication of the difficulty in obtaining crystals of suitable resolution for 
structure solution. The interpretation of the crystallisation screens and subsequent analysis of 
the results to determine which conditions to be set up, require significant experience. 

10 

Other crystals of the invention include crystals which have selected coordinates of the binding 
pocket, wherein the amino acid residues associated with those selected coordinates are located 
in a protein framework which holds these amino acids in a relative spatial configuration 
corresponding to the spatial configuration of those amino acids in Table 1, 2, 3 or 8. By 
1 5 "corresponding to", it is meant within a r.m.s.d. of less than 2.0 A, preferably less than 1 .5 A, 
more preferably less than 1 .0 A, even more preferably less than 0.64 A and most preferably less 
than 0.5 A. The amino acids which provide the selected coordinates are preferably selected 
from amino acids which form part of the binding pocket of P450, and include those of Table 5 
or 6, or combinations thereof as defined further herein below. 

20 

Crystals of the invention also include crystals of 2C9 mutants and chimeras as defined further 
below in Sections F and G. 

The invention further provides a method of assessing the ability of a compound to interact with 
25 P450 protein which comprises: 

obtaining or synthesising said compound; 

forming a crystallised complex of a P450 protein and said compound, said complex 
diffracting X-rays for the determination of atomic coordinates of said complex to a resolution of 
better than 3. 1 A and preferably at least 3 A; and 
30 analysing said complex by X-ray crystallography to determine the ability of said compound to 
interact with the P450 protein. 

D. Description of Structure. 

The analysis of the crystals obtained in the present invention has allowed a detailed analysis of 
35 the structure of a human P450 molecule. Cytochrome P450 2C9 can be considered to be a two 
domain protein, with a smaller, predominantly beta strand domain and a larger, predominantly 
alpha helical domain, forming an overall triangular arrangement. All P450 structures solved to 
date have the same overall topology, leading to a nomenclature adopted by the literature to 
describe the individual alpha helices and beta strands within P450 structures (see Ravichandran 
40 et al, Science, 1993, 261, 731-736 for definitions). The protein as purified consists of residues 
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19-494 (numbering from full length 2C9), and all but the first and last few of these residues are 
distinguishable in the electron density. The beta strand domain consists of beta sheets 1 and 2 
and alpha helices A and B. These structural elements are formed by the N-terminal region of 
the polypeptide chain (residues 30-90) and residues between the helices K and K\ These 
5 residues, along with the loops between helices B and C, and helices F and G (herein referred to 
as the B-C and F-G loops), are implicated in the interaction of mammalian P450s with the 
membrane when the protein is in its native membranous form. These loops also confer some of 
the reaction specificity to individual P450s and are among the most divergent regions of 
sequence. 

10 

The alpha helical domain consists of helices C through L. The heme moiety is located between 
the alpha helical and the beta strand domains, and sits above helix I (residues 284-3 15). The 
single protein ligand to the heme, cysteine 435, is found in a loop prior to the last alpha helix. 
Given the range of compounds that P450s metabolise, the substrate binding pockets of these 
15 enzymes can accommodate a variety of shapes and sizes. Access to and from the heme group 
may be regulated by the position of the loops that form the substrate binding site, leading to 
open and closed conformations of the enzyme. Mutational and activity data has allowed the 
mapping of regions of sequence to function. 

20 A total of six substrate recognition sites (SRS) have been proposed by Gotoh (Gotoh, J. Biol. 
Chem., 267 (1992), 83-90). Some of the residues that line the binding pocket of the 2C9 
structure include residues within these predicted SRS and include several residues that have 
been linked to changes in both specificity and reaction rates within mutant forms of the protein. 
The regiospecific hydroxylation of warfarin has been linked to polymorphism at residue 359; 

25 which lies above and to one side of the heme group, while residue 1 14 which has been shown to 
change warfarin and diclofenac hydroxylation rates, lies above and to the other side of the heme 
group. 

The structure of the present invention confirms that many of the residues inferred as potential 
30 SRS residues in the prior art by other methods (e.g. sequence alignment and mutagenesis) are 
found in the various SRSs seen in our structure. We have also identified many other residues 
which are likely to provide side chains capable of interacting with many P450 substrates. For 
example, our structure indicates a number of residues, particularly with hydrophobic side chains, 
are in the SRS regions. 

35 

In the embodiments of the invention described herein where selected coordinates of the P450 
structure may be used, the coordinates may include some or all of these residues. 

An overlay of the 2C5 and 2C9 structure indicates that while the gross features of the protein are 
40 largely conserved between the two proteins, there are some interesting differences. The first 
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resolvable residue in the electron density is residue 30 (all numbering is in relation to the full 
length protein), and the last residue is residue 490. Thus there are 10 residues without electron 
density at the N-terminus and the four histidine C-terminal tag is also not resolved. 

5 Starting at the N-terminus, the two proteins adopt the same position at residue 48. Following 
the polypeptide chain back towards the N-terminus, the position of the two sequences is out of 
register by one, and towards the end, two residues, while the backbone trace of the two proteins 
is very close. The sequence identity in this region is particularly high, so such a difference 
seems somewhat surprising. It is probably attributable to the comparatively low resolution of 
10 the 2C5 structure which made accurately assigning the sequence at the N-terminus difficult. 
The higher resolution of the 2C9 structure has made assigning the sequence in this region less 
ambiguous. Thus this structure of 2C9 may be more representative of the true conformation of 
the N-termini of both 2C5 and 2C9. 

15 The first region in which the two proteins differ substantially is the region between the B and C 
helices (residues 99 to 1 1 1). The temperature factors of the chain between residues 99 and 109 
for 2C5 are high (the average B factor for all atoms in this range is 99. 1 A 2 ), implying much 
mobility in this region, and hence little confidence can be placed in their position. In contract, 
the average B-factors for all atoms for residues 99 to 1 1 1 is 55.5 A 2 in 2C9. 

20 

In the 2C9 structure residues 101 to 106 have adopted a helical formation (helix B') that has 
been observed in bacterial P450 structures. These residues form part of SRS 1 , and thus 
contribute to the active site of the P450. The electron density has allowed unambiguous 
interpretation of all side chain positions in this region. A notable feature in this region is Arg97, 

25 which is proposed to be an important cation in the active site (2C9 substrate are predominantly 
acidic). The equivalent residue in 2C5 (Arg97) adopted a different conformation, and as a result 
did not form part of the active site. His99 has been implicated in omeprazole activity (Ibeanu et 
al., (1996), J Biol Chem, Vol. 271, 12496-12501); it is the only residue in SRS 1 not conserved 
between 2C9 and 2C19 (in 2C9 is it a He, in 2C19 a His), and mutation of this residue alone in 

30 2C19 confers omeprazole activity to the resulting mutant protein. The 2C9 structure confirms 
that this residue forms part of the active site. 

In the 2C9 structure the side chain position of Arg97 is clearly resolved, forming an interaction 
with the haem and Vail 13. Phel 14 points into the active site and is well positioned to form pi- 
35 pi stacking interactions with substrates as has been suggested by a number of groups. Phel 10 is 
in close proximity, but not as exposed at Phel 14. 

Argl05 and Argl08, which have also been suggested as potentially contributing to a cation site 
within the active site, both point away from the cavity. 

40 
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The next region of divergence between the 2C5 and 2C9 structures is the region between the F 
and G helices. Residues 212 to 222 inclusive, which form part of the F-G loop, were absent in 
the published 2C5 structure. These residues are well resolved in the 2C9 structure, and form 
two turns of helix (all secondary structure assignment done using the program DSSP (Kabsch 
5 and Sander, Biopolymers 22 ( 1 983) 2577-2637). Residues 220 and 22 1 , while not contributing 
to the active site, clearly do have some impact on the accessibility of the active site, by 
mediating the position of the F-G loop. One of the disadvantages of mapping regions of 
sequence involved in substrate contact is the inability to distinguish between those regions 
which directly contact substrates (by lining the active site) and those that mediate the interaction 
1 0 the substrate has with the P450 by regulating structural elements within the enzyme. The 2C9 
structure will allow the distinction between direct and indirect impact of individual residues on 
substrate specificity and activity. The redesign of compounds to facilitate or remove 
interactions with 2C9 is clearly going to be simplified by this distinction. 

1 5 The residues at positions 286 and 289 have been implicated in substrate specificity (Klose et al., 
(1998), Arch. Biochem. Biophys., Vol. 357, 240-248). Only residue 289 actually lines the 
active site, but both are in close proximity to Phel 1 0 of the B-C loop, and hence their role in 
substrate specificity may be an indirect one via the packing of structural elements, rather than a 
direct one through substrate contact. 

20 

Helices H and I adopt the same spatial conformation in the two proteins; the loop between the 
two helices is three residues longer and is clearly resolved in the electron density. 

Phe476 forms a hydrophobic patch in the active site along with Phel 00, Leul02, Leu208, 
25 Leu362, and Leu366. 

There are 4 other alleles of 2C9 which have currently been identified, which have an amino acid 
substitution. 2C9*2 has R144C, 2C9*3 I359L, 2C9*4 1359T and 2C9*5 D360E. Ile359 does 
not lie in the active site, but is close to Thr305 and Thr361 . It is not easy to envisage a direct 

30 effect of this residue on ability to catalyse compounds, but as has been noted for other residues, 
a mutation here may cause the shift of structural elements, which will impact on the active site. 
A similar effect may be true for Asp360. Arg 144 does not form part of the binding pocket of 
2C9. It has however been widely believed that the variation in drug metabolism properties 
exhibited by those individuals possessing.the 2C9 R144C allele variation is due to a modified 

35 interaction between the P450 and the reductase. The peripheral location of this residue in the 
structure of 2C9 would support this argument. 

Dimer Interface 

The rotation angle between the two copies in the asymmetric unit is not 1 80°, and as a result the 
40 interface between the two copies (here referred to as A and B) is non-symmetrical. The 
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interface involves a number of hydrogen bonds between residues in helix D of molecule A and 
the G-H loop of molecule B, the G-H loop of molecule A and the C-terminus and helix D of 
molecule B, the C terminus of A and the G-H loop of molecule B. 

5 E. Crystal Coordinates. 

In a further aspect, the invention also provides a crystal of P450 having the three dimensional 
atomic coordinates of Table 1, 2, 3 or 8. An advantageous feature of the structure defined by the 
atomic coordinates is that it has a high resolution (about 3 A for Table 1, about 2.6 A for Table 
2, about 3.1 A for Table 3 and about 2.6 A for Table 8). 

10 

Another advantageous feature of the invention in that it provides atomic coordinate data relating 
to the loop between helices F and G (the FG loop). The FG loop is one of the most divergent 
topological regions between the mammalian and bacterial P450 enzymes. As such, it is one of 
the more difficult parts of the mammalian enzymes to model when using a bacterial structure as 

1 5 a modelling template. The structure of P450BM3 (Ravichandran et al, 1 993, ibid) has been 
widely, used within the field as a structural template for modelling the human forms. P450BM3 
has just twelve residues in the FG loop, as opposed to the 21 residues in the 2C isoforms. The 
only mammalian P450 structure in the public domain is that of the rabbit 2C5 isoform, solved 
by X-ray crystallography to a resolution of 3.0 A (Williams et al, Mol Cell (2000), 5, 121-131). 

20 While the 2C5 structure does provide an improved modelling template when compared to the 
bacterial structures, the position of the FG loop was not resolvable in the crystal structure. In 
contrast, the 2C9 structure described here includes the FG loop. Residues within the FG loop 
have not been widely implicated in the substrate selectivity of P450s, and lie outside the 
substrate recognition sites (SRS's) identified by Gotoh (Gotoh, O, J. Biol. Chem, 267; 83-90 

25 (1992)). Residues within the FG loop have been shown to modify the compound binding 
specificity of 2C9 (Tsao et al, Biochemistry (2001), 40, 1937-1944). It was not clear whether 
this effect was due to direct interaction of residues within the FG loop and the compound, or a 
secondary effect caused by the interaction of these residues with residues within the pocket that 
fall within the substrate recognition sites (SRS) of the enzymes. It is now evident from our 

30 structure that the residues of the FG loop do not contribute to the binding pocket. The structure 
of 2C9 will therefore more readily facilitate the identification of direct and indirect interactions 
between compounds and 2C9. 

Another advantageous feature is that the average B-factor of the 2C9 structure is 43.9 A 2 in 
35 contrast to the 2C5 structure which had an overall B-factor of 58.6 A 2 , resulting in a better 
definition for most of the side chains within the structure. This is advantageous for all uses of 
the coordinates, especially in silico work, molecular replacement, and homology modelling. 

A further advantage of the 2C9 structures described herein is that they are unliganded, apo 
40 structures. This makes them particularly suitable for soaking in ligands and hence determining 
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co-complex structures and, are also ideal for homology modelling purposes as there is no 
conformational bias from a ligand. 

The BC and FG loops are among the most varied features of cytochromes P450. Both loops 
5 contribute to the enzymes catalytic cycle; the BC loop directly by providing residues that form 
part of the active site, and mediate specificity and activity interactions, and the FG loop by 
movement allowing substrate entry and exit. In this high resolution 2C9 structure both of these 
loops are well resolved, in contrast to the 2C5 structure. 

10 Tables 1, 2, 3 and 8 give atomic coordinate data for P450 2C9. In Tables 1, 2, 3 and 8 (he third 
column denotes the atom, the fourth the residue type, the fifth the chain identification (either A 
or B), the sixth the residue number (the atom numbering is with respect to the full length wild 
type protein), the seventh, eighth and ninth columns are the X, Y, Z coordinates respectively of 
the atom in question, the tenth column the occupancy of the atom, the eleventh the temperature 

1 5 factor of the atom, the twelfth (where present) the chain identification, and the last the atom 
type. 

The coordinates of Tables 1,2,3 and 8 provide a measure of atomic location in Angstroms, to 3 
decimal places. The coordinates are a relative set of positions that define a shape in three 

20 dimensions, but the skilled person would understand that an entirely different set of coordinates 
having a different origin and/or axes could define a similar or identical shape. Furthermore, the 
skilled person would understand that varying the relative atomic positions of the atoms of the 
structure so that the root mean square deviation of the residue backbone atoms (i.e. the nitrogen- 
carbon-carbon backbone atoms of the protein amino acid residues) is less than 2.0 A, preferably 

25 less than 1 .5 A, more preferably less than 1.0 A, even more preferably less than 0.64 A and most 
preferably less than 0.5 A when superimposed on the coordinates provided in Table 1, 2, 3 or 8 
for the residue backbone atoms, will generally result in a structure which is substantially the 
same as the structure of Table 1, 2, 3 or 8 in terms of both its structural characteristics and 
usefulness for structure-based analysis of P450-interactivity molecular structures. 

30 

Likewise the skilled person would understand that changing the number and/or positions of the 
water molecules and/or substrate molecules of Table 1, 2, 3 or 8 will not generally affect the 
usefulness of the structure for structure-based analysis of P450-interacting structure. Thus for 
the purposes described herein as being aspects of the present invention, it is within the scope of 
35 the invention if: the Table 1, 2, 3 or 8 coordinates are transposed to a different origin and/or 
axes; the relative atomic positions of the atoms of the structure are varied so that the root mean 
square deviation of residue backbone atoms is less than 2.0 A, preferably less than 1 .5 A, more 
preferably less than 1.0 A, even more preferably less than 0.64 A and most preferably less than 
0.5 A when superimposed on the coordinates provided in Table 1, 2, 3 or 8 for the residue 
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backbone atoms; and/or the number and/or positions of water molecules and/or substrate 
molecules is varied. 

Reference herein to the coordinate data of Table 1, 2, 3 or 8 and the like thus includes the 
coordinate data in which one or more individual values of the Table are varied in this way. By 
"root mean square deviation" we mean the square root of the arithmetic mean of the squares of 
the deviations from the mean. 

Thus, for example, varying the atomic positions of the atoms of the structure by up to about 0.5 
A, preferably up to about 0.3 A in any direction will result in a structure which is substantially 
the same as the structure of Table 1, 2, 3 or 8 in terms of both its structural characteristics and 
utility e.g. for molecular structure-based analysis. 

Those of skill in the art will appreciate that in many applications of the invention, it is not 
necessary to utilise all the coordinates of Table 1, 2, 3 or 8, but merely a portion of them. For 
example, as described below, in methods of modelling candidate compounds with P450, 
selected coordinates of 2C9 may be used. 

By "selected coordinates" it is meant for example at least 5, preferably at least 10, more 
preferably at least 50 and even more preferably at least 100, for example at least 500 or at least 
1000 atoms of the 2C9 structure. Likewise, the other applications of the invention described 
herein, including homology modelling and structure solution, and data storage and computer 
assisted manipulation of the coordinates, may also utilise all or a portion of the coordinates (i.e. 
selected coordinates) of Table 1, 2, 3 or 8. The selected coordinates may include or may consist 
of atoms found in the 2C9 P450 binding pocket, as described herein below. 

F. Chimaeras 

The use of chimaeric proteins to achieve desired properties is now common in the scientific 
literature. For example, Sieber et al (Nature Biotechnology (2001) 19, 456-460) produced 
hybrids between human cytochrome P450 isoform 1 A2 and the bacterial P450 BM3, in order to 
make proteins with the specificity of 1 A2, but which had desirable expression and solubility 
properties of BM3. Active site chimaeras are also described: for example, Swairjo et al 
(Biochemistry (1998) 37, 10928-10936) made loop chimaeras of HIV- 1 and HIV-2 protease to 
try to understand determinants of inhibitor-binding specificity. 

Of particular relevance are cases where the active site is modified so as to provide a surrogate 
system to obtain structural information. Thus Ikuta et al (J Biol Chem (2001) 276, 27548- 
27554) modified the active site of cdk2, for which they could obtain structural data, to resemble 
that of cdk4, for which no X-ray structure is currently available. In this way they were able to 
obtain protein/Iigand structures from the chimaeric protein which were useful in cdk4 inhibitor 
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design. In a similar way, based on comparison of primary sequences of highly related isoforms 
(such as 2C 19 or even 2D6), the active site of the 2C9 protein could be modified to resemble 
those isoforms. Protein structures or protein/ligand structures of the chimaeric proteins could be 
used in structure-based alteration of the metabolism of compounds which are substrates of that 
related P450 isoform. 

Even if the percentage of the amino acid sequence identity between mammalian P450 ranks 
from 20 to 80%, the overall folding of mammalian P450s is expected to be very similar, with the 
same spatial distribution of the structural elements. Furthermore, this class of enzymes exhibits 
distinct substrate specificities that rely on only a limited number of residues located in non- 
contiguous parts of the polypeptide chain. The substrate-binding pocket of P450 is generally 
constituted by residues that fall in the SRS regions (substrate recognition sites) defined by 
Gotoh (Gotoh, 0, J. Biol. Chem, 267; 83-90 (1992)) and in loops of the molecule. 

Aspects of the present invention therefore relate to modification of P450 proteins such that the 
active sites mimic those of related isoforms. For example, from a knowledge of the structure 
and residues of the active site of the human 2C9 protein described herein, and that of the rabbit 
2C5 protein published previously, a person skilled in the art could modify the 2C5 protein such 
that the active site mimicked that of human 2C9. This protein could then be used to obtain 
information on compound binding through the determination of protein/ligand complex 
structures using the chimaeric 2C5 protein. w 

For example, in one aspect the present invention provides a chimaeric protein having a binding 
cavity which provides a substrate specificity substantially identical to that of P450 2C9 protein, 
wherein the chimaeric protein binding cavity is lined by a plurality of atoms which correspond 
to selected P450 2C9 atoms lining the P450 2C9 binding cavity, the relative positions of the 
plurality of atoms corresponding to the relative positions, as defined by Table 1, 2, 3 or 8, of the 
selected P450 2C9 atoms. 

It is possible to postulate that only few changes would be required to inter-convert the substrate 
specificities of P450 isoforms that exhibit more than 70% of amino acid identity. For example, 
2C9 and 2C1 9, although they differ at only 43 of 490 amino acids, exhibits clear substrate 
specificity differences. Using a panel of 2C9/2C1 9 chimaeric proteins, Jung et aL (Jung, F. 
Biochemistry, 37, 16270-16279 (1998)), have identified the sequences differences that confer to 
2C19 a high affinity binding to sulfaphenazole, a very potent and specific inhibitor of 2C9. Site 
directed mutagenesis experiments have revealed that the conversion of 2C 19 to a 2C9-like 
protein was possible by introducing a limited number of substitutions in the 2C19 amino acid 
sequence. These mutations are located in the SRS3 and SRS4 regions of the proteins. Similar 
studies performed by Klose et al. (Arch. Biochem. Biophys. 357, 240-248 (1998)) and Tsao et 
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al. (Biochemistry, 40, 1937-1944, (2001)) have demonstrated the feasibility of the transfer of 
substrate specificities between 2C9 and 2C19 by mutating SRS regions. 



The substrate specificity of an enzyme generally relies on only a limited number of residues 
5 located in non-contiguous parts of the polypeptide chain. The substrate specificities of these 
isoforms could be analysed by substituting these residues by site-directed mutagenesis. The 
minimal changes that would be required to convert another protein into a 2C9-Iike chimera 
could be at least two amino acids selected from Table 4. These mutations can be introduced by 
site-directed mutagenesis e.g. using a Stratagene QuikChange™ Site-Directed Mutagenesis Kit 
10 or cassette mutagenesis methods (Ausubel, F.M., Brent, R., Kingston, R.E. et al. editors. Current 
Protocols in Molecular Biology. John Wiley & Sons, Inc., New York, Sambrook, J., Fritsch, 
E.F., and Maniatis, T. (1989). Molecular Cloning: a Laboratory Manual. 2nd ed. Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, NY.). Thus the invention provides a chimaeric 
protein having one or more binding pockets defined by the residues of Table 4. 

15 ' 

This strategy could clearly be applied for proteins that exhibit high sequence homology with or 
without overlapping substrate specificities and from different species. The rabbit 2C5 and the 
human 2C9 and 2C19 P45Qs have been reported to be involved in the metabolism of 
progesterone with different rates, the rabbit isoform being clearly the most efficient enzyme. 
20 The use of the crystal structures solved for 2C5 and 2C9 would allow the characterization of the 
binding mode of the progesterone molecule in the substrate pocket of these proteins. This in 
turn would allow the identification of residues to be modified in the human isofoims to convert 
them into efficient progesterone metabolising enzymes. 

25 In one embodiment, a chimaeric 2C9 enzyme is produced which is isoformal with another 

enzyme of the 2C subfamily. For example, 2C9 could be turned into a 2C1 9-like isoform with a 
few amino acid changes. Based on the information available from the literature on the 
structure/activity studies performed on the 2C9 and 2C19 isoforms, and the analysis of the 
structure of the human 2C9, we postulate that the 2C9 protein could be converted to a 2C1 9-like 

30 protein with the substrate specificities attributed to 2C 1 9. 
The residues to be mutated are one or more of: 

Substitute SRS 1 of 2C9 with SRS 1 of 2C19 (the amino acid change introduced is I99H); 
and/or 

Substitute SRS 3 of 2C9 with SRS 3 of 2C19 (the amino acid changes introduced are V237L 
35 and K241E); and/or 

Substitute SRS 4 of 2C9 with SRS 4 of 2C19 (the amino acid changes introduced are S286N, 
E288V, N289I, V292A and F295L - the key changes could be S286N, N289I, V292A and 
F295L); and/or 

Move SRS 5 of 2C19 to 2C9 (the amino acid L362I is introduced). 
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The minimal changes that would be required to convert 2C9 to 2C19 could be I99H, K241E, 
S286N, N289I, V292A, F295L and L362I and more likely to be I99H, S286N, N289I, V292A, 
and F295L. These mutations can be introduced by site-directed mutagenesis or cassette 
mutagenesis methods, as described herein. 

5 

A 2C19-like chimera can also be made by making the following changes: I99H, S286N, E288V, 
N289I, V292A, F295L. An alternative minimal change would be I99H, S286N, N289I. 

The crystallization of such chimeras and the determination of the three-dimensional structures 
10 relies on the ability of our 2C9 proteins to yield crystals that diffract at high resolution. The aim 
is to modify the inside part or 2C9 to produce a new substrate binding site of 2C19 without 
modifying the outside shell of the proteins that allow the protein to crystallize. 

G. Homology Modelling, 

15 The invention also provides a means for homology modelling of other proteins (referred to 
below as target P450 proteins). By "homology modelling", it is meant the prediction of related 
P450 structures based either on x-ray crystallographic data or computer-assisted de novo 
prediction of structure, based upon manipulation of the 'coordinate data of Table 1, 2, 3 or 8. 

20 The P450 structure set out in Table 1, 2, 3 or 8 is, as explained in further detail herein, a dimer 
structure. The various in silico modelling techniques described in this section and in the other 
sections of this application may utilize either the dimer structure of Table 1, 2, 3 or 8, or either 
of the subunits A and B. To avoid unnecessary repetition, reference is made herein to the 
coordinate data of Table 1 , 2, 3 or 8, but this will be understood to mean either the data for both 

25 subunits or just one of the subunits. 

"Homology modelling" extends to target P450 proteins which are analogues or homologues of 
the 2C9 P450 protein whose structure has been determined in the accompanying examples. It 
also extends to P450 protein mutants of 2C9 protein itself. 

30 

In general, the method involves comparing the amino acid sequences of the 2C9 P450 protein of 
Table 1, 2, 3 or 8 with a target P450 protein by aligning the amino acid sequences. Amino acids 
in the sequences are then compared and groups of amino acids that are homologous 
(conveniently referred to as "corresponding regions") are grouped together. This method detects 
35 conserved regions of the polypeptides and accounts for amino acid insertions or deletions. 

Homology between amino acid sequences can be determined using commercially available 
algorithms. The programs BLAST, gapped BLAST, BLASTN, PS1-BLAST and BLAST 2 
sequences (provided by the National Center for Biotechnology Information) are widely used in 
40 the art for this purpose, and can align homologous regions of two amino acid sequences. These 
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may be used with default parameters to determine the degree of homology between the amino 
acid sequence of the Table 1, 2, 3 or 8 protein and other target P450 proteins which are to be 
modelled. 

5 Analogues are defined as proteins with similar three-dimensional structures and/or functions and 
little evidence of a common ancestor at a sequence level. 



Homologues are defined as proteins with evidence of a common ancestor i.e. likely to be the 
result of evolutionary divergence and are divided into remote, medium and close sub-divisions 
10 based on the degree (usually expressed as a percentage) of sequence identity. 

A homologue is defined here as a protein with at least 15% sequence identity or which has at 
least one functional domain, which is characteristic of 2C9. This includes polymorphic forms of 
2C9. 

15 

There are two types of homologue: orthologues and paralogues. Orthologues are defined as 
homologous genes in different organisms, i.e. the genes share a common ancestor coincident 
. with the? speciation event that generated them. Paralogues are defined as homologous genes in 
the same organism derived from a gene/chromosome/genome duplication, i.e. the common 
20 ancestor of the genes occurred since the last speciation event. 

A mutant is a 2C9 characterized by replacement or deletion of at least one amino acid from the 
wild type 2C9. Such a mutant may be prepared for example by site-specific mutagenesis, or 
incorporation of natural or unnatural amino acids. 

25 

The present invention contemplates "mutants" wherein a "mutant" refers to a polypeptide which 
is obtained by.replacing at least one amino acid residue in a native or synthetic 2C9 with a 
different amino acid residue and/or by adding and/or deleting amino acid residues within the 
native polypeptide or at the N- and/or C-terminus of a polypeptide corresponding to 2C9 and 

30 which has substantially the same three-dimensional structure as 2C9 from which it is derived. 
By having substantially the same three-dimensional structure is meant having a set of atomic 
structure coordinates that have a root mean square deviation (r.m.s.d.) of less than or equal to 
about 2.0 A when superimposed with the atomic structure coordinates of the 2C9 from which 
the mutant is derived when at least about 50% to 100% of the C a atoms of the 2C9 are included 

35 in the superposition. A mutant may have, but need not have, enzymatic or catalytic activity. 

To produce homologues or mutants, amino acids present in the said protein can be replaced by 
other amino acids having similar properties, for example hydrophobicity, hydrophobic moment, 
antigenicity, propensity to form or break a-helical or (3-sheet structures, and so. Substitutional 
40 variants of a protein are those in which at least one amino acid in the protein sequence has been 
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removed and a different residue inserted in its place. Amino acid substitutions are typically of 
single residues but may be clustered depending on functional constraints e.g. at a crystal contact. 
Preferably amino acid substitutions will comprise conservative amino acid substitutions. 
Insertional amino acid variants are those in which one or more amino acids are introduced. This 
5 can be amino-terminal and/or carboxy-terminal fusion as well as intrasequence. Examples of 
amino-terminal and/or carboxy-terminal fusions are affinity tags, MBP tag, and epitope tags. 



Amino acid substitutions, deletions and additions which do not significantly interfere with the 
three-dimensional structure of the 2C9 will depend, in part, on the region of the 2C9 where the 
1 0 substitution, addition or deletion occurs. In highly variable regions of the molecule, non- 
conservative substitutions as well as conservative substitutions may be tolerated without 
significantly disrupting the three-dimensional structure of the molecule. In highly conserved 
regions, or regions containing significant secondary structure, conservative amino acid 
substitutions are preferred. 

15 

Conservative amino acid substitutions are well-known in the art, and include substitutions made 
on the basis of similarity in polarity, charge, solubility, hydrophobicity, hydrophilicity and/or 
the amphipathic nature of the amino acid residues involved. For example, negatively charged 
amino acids include aspartic acid and glutamic acid; positively charged amino acids include 
20 lysine and arginine; amino acids with uncharged polar head groups having similar hydrophilicity 
values include the following: leucine, isoleucine, valine; glycine, alanine; asparagine, glutamine; 
serine, threonine; phenylalanine, tyrosine. Other conservative amino acid substitutions are well 
known in the art. 

25 In some instances, it may be particularly advantageous or convenient to substitute, delete and/or 
add amino acid residues to a 2C9 binding pocket or catalytic residue in order to provide 
convenient cloning sites in cDNA encoding the polypeptide, to aid in purification of the 
polypeptide, etc. Such substitutions, deletions and/or additions which do not substantially alter 
the three dimensional structure of 2C9 will be apparent to those having skills in the art. 

30 

It should be noted that the mutants contemplated herein need not exhibit enzymatic activity. 
Indeed, amino acid substitutions, additions or deletions that interfere with the catalytic activity 
of the 2C9 but which do not significantly alter the three-dimensional structure of the catalytic 
region are specifically contemplated by the invention. Such crystalline polypeptides, or the 
35 atomic structure coordinates obtained there from, can be used to identify compounds that bind to 
the protein. 

Once the amino acid sequences of the polypeptides with known and unknown structures are 
aligned, the structures of the conserved amino acids in a computer representation of the 
40 polypeptide with known structure are transferred to the corresponding amino acids of the 
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polypeptide whose structure is unknown. For example, a tyrosine in the amino acid sequence of 
known structure may be replaced by a phenylalanine, the corresponding homologous amino acid 
in the amino acid sequence of unknown structure. 

The structures of amino acids located in non-conserved regions may be assigned manually by 
using standard peptide geometries or by molecular simulation techniques, such as molecular 
dynamics. The final step in the process is accomplished by refining the entire structure using 
molecular dynamics and/or energy minimization. 

Homology modelling as such is a technique that is well known to those skilled in the art (see 
e.g. Greer, Science, Vol. 228, (1985), 1055, and Blundell et al, Eur. J. Biochem, Vol. 172, 
(1988), 513). The techniques described in these references, as well as other homology 
modelling techniques generally available in the art, may be used in performing the present 
invention. 

Thus the invention provides a method of homology modelling comprising the steps of: 

(a) aligning a representation of an amino acid sequence of a target P450 protein of unknown 
three-dimensional structure with the amino acid sequence of the P450 of Table 1, 2, 3 or 8 to 
match homologous regions of die amino acid sequences; 

(b) ;modeIling the structure of the matched homologous regions of said target P450 of unknown 
structure on the corresponding regions of the P450 structure as defined by Table 1, 2, 3 or 8; and 

. (c) determining a conformation (e.g. so that favourable interactions are formed within the target 
P450 of unknown structure and/or so that a low energy conformation is formed) for said target 
P450 of unknown structure which substantially preserves the structure of said matched 
homologous regions. 

Preferably one or all of steps (a) to (c) are performed by computer modelling. 

The presence of the FG loop in our structure is particularly advantageous for modelling of other 
P450s especially mammalian P450s, which have longer FG loops than bacterial P450s as there 
is currently nothing known in the art about the conformation of the FG loop in mammalian 
structures. This is advantageous for modelling compounds into this structure or modelled 
structures. 

The data of Table 1, 2, 3 or 8 will be particularly advantageous for homology modelling of other 
human P450 proteins, in particular human P450s such as 2C8, 2C18, 2C19, 2D6, 3A4, 1A1, 
1 A2, 2E1 . These proteins may be the target P450 protein in the method of the invention 
described above. 
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In a particularly preferred aspect, the homology model is selected from the group consisting of 
2C19, 2C1 8 and 2C8. The accompanying examples show a complete homology model for 2C1 9 
and the coordinates of 2C1 8 and 2C8 which may be introduced into the structures of 2C9 or 
2C19 in order to provide a homology model of these proteins. The resulting homology models 
5 may be used in the methods described herein below in sections H, I and J. 

H. Structure Solution 

The structure of the human 2C9 P450 can also be used to solve the crystal structure of other 
target P450 proteins including other crystal forms of 2C9, mutants, co-complexes of 2C9, where 
10 X-ray diffraction data of these target P450 proteins has been generated and requires 
interpretation in order to provide a structure. 

In the case of 2C9, this protein may crystallize in more than one crystal form. The structure 
coordinates of 2C9, or portions thereof, as provided by this invention are particularly useful to 
15 solve the structure of those other crystal forms of 2C9. They may also be used to solve the 

structure of 2C9 mutants, 2C9 co-complexes, or of the crystalline form of any other protein with 
significant amino acid sequence homology to any functional domain of 2C9. 

In the case of other target P450 proteins, particularly the human P450 proteins referred to in 
20 Section D above, the present invention allows the structures of such targets to be obtained more 
readily where raw X-ray diffractibh data is generated. 

Thus, where X-ray crystallographic or NMR spectroscopic data is provided for a target P450 of 
unknown three-dimensional structure, the structure of P450 as defined by Table 1, 2, 3, 8 or 18 
25 may be used to interpret that data to provide a likely structure for the other P450 by techniques 
which are well known in the art, e.g. phasing in the case of X-ray crystallography and assisting 
peak assignments in NMR spectra. 

One method that may be employed for these purposes is molecular replacement. In this method, 
30 the unknown crystal structure, whether it is another crystal form of 2C9, a 2C9 mutant, or a 2C9 
co-complex, or the crystal of a target P450 protein with amino acid sequence homology to any 
functional domain of 2C9, may be determined using the 2C9 structure coordinates of this 
invention as provided herein. This method will provide an accurate structural form for the 
unknown crystal more quickly and efficiently than attempting to determine such information ab 
35 initio. 



Examples of computer programs known in the art for performing molecular replacement are 
CNX (Brunger AT.; Adams P.D.; Rice L.M., Current Opinion in Structural Biology, Volume 8, 
Issue 5, October 1998, Pages 606-61 1 (also commercially available from Accelerys San Diego, 
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CA) or AMORE (Navaza, J. (1994). AMoRe: an automated package for molecular replacement. 
ActaCryst A50, 157-163). 

Thus, in a further aspect of the invention provides a method for determining the structure of a 
5 protein, which method comprises; 

providing the coordinates of Table 1, 2, 3 or 8, and 
positioning the coordinates in the crystal unit cell of said protein so as to provide a structure for 
said protein. 

10 In a preferred aspect of this invention the coordinates are used to solve the structure of target 
P450s particularly homologues of 2C9 for example 2C19, 2C8, 2C18. 

The invention may also be used to assign peaks of NMR spectra of such proteins, by 
manipulation of the data of Table 1 , 2, 3 or 8. 

15 

I. Computer Systems. 

In another aspect, the present invention provides systems, particularly a computer system, the 
systems containing either (a) atomic coordinate data according to Table 1, 2, 3, 8 or 18, said data 
defining the three-dimensional structure of P450 or at least selected coordinates thereof; (b) 

20 structure factor data (where a structure factor comprises the amplitude and phase tifthe 

diffracted wave) for P450, said structure factor data being derivable from the atomic coordinate 
data of Table 1, 2, 3, 8 or 18; (c) atomic coordinate data of a target P450 protein generated by 
homology modelling of the target based on the data of Table 1 , 2, 3, 8 or 1 8; (d) atomic 
coordinate data of a target P450 protein generated by interpreting X-ray crystallographic data or 

25 NMR data by reference to the data of Table 1 , 2, 3 or 1 8; or (e) structure factor data derivable 
from the atomic coordinate data of (c) or (d). 

The atomic coordinate data may be the data of the entire Table or a selected portion thereof 

30 With regard to (c) above, it will be appreciated that Table 1 8 itself is atomic coordinate data of a 
2C 19 obtained by the homology modelling the 2C9 structure of the present invention and the 
data of Table 1 8, and its use, forms a further aspect of the invention. 

The invention also provides such systems containing atomic coordinate data of target P450 
35 proteins wherein such data has been generated according to the methods of the invention 
described herein based on the starting data provided by Table 1 , 2, 3, 8 or 1 8. 

Such data is useful for a number of purposes, including the generation of structures to analyse 
the mechanisms of action of P450 proteins and/or to perform rational drug design of compounds 
40 which interact with P450, such as compounds which are metabolised by P450s. 
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In a further aspect, the present invention provides computer readable storage medium with either 
(a) atomic coordinate data according to Table 1, 2, 3, 8 or 1 8 recorded thereon, said data 
defining the three-dimensional structure of P450, or at least selected coordinates thereof; (b) 
structure factor data for P450 recorded thereon, the structure factor data being derivable from 
the atomic coordinate data of Table 1, 2, 3, 8 or 18; (c) atomic coordinate data of a target P450 
protein generated by homology modelling of the target based on the data of Table 1, 2, 3,8 or 
18; (d) atomic coordinate data of a target P450 protein generated by interpreting X-ray 
crystallographic data or NMR data by reference to the data of Table 1 , 2, 3, 8 or 1 8; or (e) 
structure factor data derivable from the atomic coordinate data of (c) or (d). 

The atomic coordinate data may be the data of the entire Table or a selected portion thereof. 

As used herein, "computer-readable storage medium" refers to any medium or media which can 
be read and accessed directly by a computer. Such media include, but are not limited to: 
magnetic storage media such as floppy discs, hard disc storage medium and magnetic tape; 
optical storage media such as optical discs or CD-ROM; electrical storage media such as RAM 
and ROM; and hybrids of these categories such as magnetic/optical storage media. 

By providing such a storage medium, the atomic coordinate data can be routinely accessed to 
model P450 or selected coordinates thereof. For example, RASMOL (Sayle et al., TIBS, Vol. 
20, (1995), 374) is a publicly available computer software package which allows access and 
analysis of atomic coordinate data for structure determination and/or rational drug design. 

On the other hand, structure factor data, which are derivable from atomic coordinate data (see 
e.g. Blundell et aL, in Protein Crystallography, Academic Press, New York, London and San 
Francisco, (1976)), are particularly useful for calculating e.g. difference Fourier electron density 
maps. 

As used herein, "a computer system" refers to the hardware means, software means and data 
storage means used to analyse the atomic coordinate data of the present invention. The 
minimum hardware means of the computer-based systems of the present invention typically 
comprises a central processing unit (CPU), a working memory and data storage means, and e.g. 
input means, output means etc. Desirably a monitor is provided to visualize structure data. The 
data storage means may be RAM or means for accessing computer readable media of the 
invention. Examples of such systems are microcomputer workstations available from Silicon 
Graphics Incorporated and Sun Microsystems running Unix based, Windows NT or IBM OS/2 
operating systems. 
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In another aspect, the invention provides a computer-readable storage medium, comprising a 
data storage material encoded with computer readable data, wherein the data are defined by all 
or a portion (i.e. selected coordinates as defined herein) of the structure coordinates of 2C9 of 
Table 1, 2, 3 or 8, or a homologue of 2C9 including the structure of 2C19 of Table 18, wherein 
said homologue comprises backbone atoms that have a root mean square deviation from the 
backbone atoms (nitrogen-carbon^-carbon) of Table 1, 2, 3 or 8 of not more than 2.0 A 
(preferably not more than 1 .5 A). 

The invention also provides a computer-readable data storage medium comprising a data storage 
material encoded with a first set of computer-readable data comprising a Fourier transform of at 
least a portion (i.e. selected coordinates as defined herein) of the structural coordinates for 2C9 
according to Table 1, 2, 3 or 8 or 2C 19 of Table 18; which, when combined with a second set of 
machine readable data comprising an X-ray diffraction pattern of a molecule or molecular 
complex of unknown structure, using a machine programmed with the instructions for using said 
first set of data and said second set of data, can determine at least a portion of the structure 
coordinates corresponding to the second set of machine readable data. 

A further aspect of the invention provides a method of providing data- for generating structures 
and/or performing drug design with 2C9, 2C9 homologues or analogues, complexes of 2C9 with 
a compound j or complexes of 2C9 homologues or analogues with compounds, the method 
comprising: \ : ; : 

(i) establishing communication with a remote device containing computer-readable data 
comprising at least one of: (a) atomic coordinate data according to Table 1, 2, 3 or 8, said data 
defining the three-dimensional structure of 2C9, at least one sub-domain of the three- 
dimensional structure of 2C9, or the coordinates of a plurality of atoms of 2C9; (b) structure 
factor data for 2C9, said structure factor data being derivable from the atomic coordinate data of 
Table 1, 2, 3 or 8; (c) atomic coordinate data of a target 2C9 homologue or analogue generated 
by homology modelling of the target based on the data of Table 1, 2, 3 or 8, such as the data of 
Table 18; (d) atomic coordinate data of a protein generated by interpreting X-ray 
crystallographic data or NMR data by reference to the data of Table 1, 2, 3 or 8; and (e) 
structure factor data derivable from the atomic coordinate data of (c) or (d); and 

(ii) receiving said computer-readable data from said remote device. 

Thus another aspect of the invention provides a method of providing data for generating 
structures and/or performing drug design with 2C19, 2C19 homologues or analogues, 
complexes of 2C19 with a compound, or complexes of 2C19 homologues or analogues with 
compounds, the method comprising: 

(i) establishing communication with a remote device containing computer-readable data 
comprising at least one of: (a) atomic coordinate data according to Table 18, said data defining 
the three-dimensional structure of 2C19, at least one sub-domain of the three-dimensional 
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structure of 2C1 9, or the coordinates of a plurality of atoms of 2C19; (b) structure factor data for 
2C19, said structure factor data being derivable from the atomic coordinate data of Table 18; (c) 
atomic coordinate data of a target 2C19 homologue or analogue generated by homology 
modelling of the target based on the data of Table 1 8; (d) atomic coordinate data of a protein 
5 generated by interpreting X-ray crystallographic data or NMR data by reference to the data of 
Table 1 8; and (e) structure factor data derivable from the atomic coordinate data of (c) or (d); 
and 

(ii) receiving said computer-readable data from said remote device. 

10 Thus the remote device may comprise e.g. a computer system or a computer-readable storage 
medium of one of the previous aspects of the invention. The device may be in a different 
country or jurisdiction from where the computer-readable data is received. 

The communication may be via the internet, intranet, e-mail etc. Typically the communication 
15 will be electronic in nature, but some or all of the communication pathway may be optical, for 
example, over optical fibres. 

J. Uses of the Structures of the Invention. 

The crystal structures obtained according to the present invention (including the structures of 
20 Table 1 , 2, 3, 8 and 1 8 as well the structures of target P450 proteins obtained in accordance with 
the methods described herein) may be used in several ways for drug design. For example, many 
drugs or drag candidates fail to be of clinical use due to the detrimental interactions with P450 
proteins, resulting in a rapid clearance of the drugs from the body. The present invention will 
allow those of skill in the art to attempt to rescue such compounds from development by 
25 following these structure-based chemical strategies. 

In the case where a drug molecule is being metabolised by a P450, information on the binding 
orientation by either co-crystallization, soaking or computationally docking the binding 
orientation of the drug in the binding pocket can be determined. This will guide specific 
30 modifications to the chemical structure designed to mediate or control the interaction of the drug 
with the protein. Such modifications can be designed with an aim of reducing the metabolism of 
the drug by P450 and so of improving its therapeutic action. 

The crystal structure could also be useful to understand drug-drug interactions. Many examples 
35 exist where adverse reactions to drugs are recorded if administered while the patient is already 
taking other medicines. The mechanism behind this detrimental and often dangerous drug-drug 
interaction scenario may be when one drug behaves as an inhibitor of a P450 resulting in toxic 
levels of the other drug building-up due to less or no metabolism occurring. The crystal structure 
of the present invention complexed to such an inhibitor (either in vitro or in silico) may also 
40 allow rational modifications either to modify the inhibitor such that it no longer inhibits or 
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inhibits less, or to modify the second drug such that it could bind better to the P450 (so 
becoming metabolised) and so displace the inhibitor. 

P450s display significant polymorphic variations dependent on ethnic origin of the patient. This 
5 can manifest itself in adverse reactions from some segments of patient populations to some 
drugs. By using the crystal structures of the present invention to map the relevant mutation with 
respect to the binding mode of the drug, chemical modifications could also be made to the drug 
to avoid interactions with the variable region of the protein. This would ensure more consistent 
therapeutic value from the drug for such segments of the population and avoid dangerous side- 
10 effects. 

Some pharmaceutical compounds are converted by P450s into active metabolites. In the case of 
such compounds, a greater understanding of how such compounds are converted by a P450 will 
allow modification of the compound so that it can be converted at a different rate. For example, 
15 increasing the rate of conversion may allow a more rapid delivery of a desired therapeutic effect, 
whereas decreasing the rate of conversion may allow for higher doses to be administered or the 
development of sustained release pharmaceutical preparations, for example comprising a 
mixture of compounds which are metabolised at different rates to form the same active 
metabolite. 

20 ; . ; 

Thus, the determination of the three-dimensional structure of P450 provides a basis for the 
design of new compounds which interact with P450 in novel ways. For example, knowing the 
three-dimensional structure of P450, computer modelling programs may be used to design 
different molecules expected to interact with possible or confirmed active sites, such as binding 
25 sites or other structural or functional features of P450. 

(i) Obtaining and analysing crystal complexes. 

In one approach, the structure of a compound bound to a P450 may be determined by 
experiment. This will provide a starting point in the analysis of the compound bound to P450, 
30 thus providing those of skill in the art with a detailed insight as to how that particular compound 
interacts with P450 and the mechanism by which it is metabolised. 

Many of the techniques and approaches to structure-based drug design described above rely at 
some stage on X-ray analysis to identify the binding position of a ligand in a ligand-protein 
35 complex. A common way of doing this is to perform X-ray crystallography on the complex, 
produce a difference Fourier electron density map, and associate a particular pattern of electron 
density with the ligand. However, in order to produce the map (as explained e.g. by Blundell et 
al., mentioned above) it is necessary to know beforehand the protein 3D structure (or at least the 
protein structure factors). Therefore, determination of the P450 structure also allows production 
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of difference Fourier electron density maps of P450-compound complexes and determination of 
the binding position of a drug, and hence may greatly assist the process of rational drug design. 

Accordingly, the invention provides a method for determining the structure of a compound 
5 bound to P450, said method comprising: 

providing a crystal of 2C9 P450 according to the invention; 
soaking the crystal with said compounds; and 

determining the structure of said 2C9 P450 compound complex by employing the data of 
Table 1,2, 3, 8 or 18. 

10 

Alternatively, the P450 and compound may be co-crystallized. Thus the invention provides a 
method for determining the structure of a compound bound to P450, said method comprising; 
mixing the protein with the compound(s), crystallizing the protein-compound(s) complex; and 
determining the structure of said P450-compound(s) complex by reference to the data of Table 
15 1,2, 3, 8 or 18. 

The analysis of such structures may employ (i) X-ray crystallographic diffraction data from the 
: ^ - complex and (ii) a three-dimensional structure of P450, or at least selected coordinates thereof, 

to generate a difference Fourier electron density map of the complex, the three-dimensional 
. 20 structure being defined by atomic coordinate data according to Table 1 , 2, 3 or 8. The 
: . difference Fourier electron density map may then be analysed. 

Therefore, such complexes can be crystallized and analysed using X-ray diffraction methods, 
e.g. according to the approach described by Greer et al., J. of Medicinal Chemistry, Vol. 37, 
25 (1994), 1035-1054, and difference Fourier electron density maps can be calculated based on X- 
ray diffraction patterns of soaked or co-crystallized P450 and the solved structure of 
uncomplexed P450. These maps can then be analysed e.g. to determine whether and where a 
particular compound binds to P450 and/or changes the conformation of P450. 

30 Electron density maps can be calculated using programs such as those from the CCP4 

computing package (Collaborative Computational Project 4. The CCP4 Suite: Programs for 
Protein Crystallography, Acta Crystallographica, D50, (1994), 760-763.). For map 
visualization and model building programs such as "O" (Jones et al., Acta Crystallographica, 
A47, (1991), 110-119) can be used. 

35 

In addition, in accordance with this invention, 2C9 mutants may be crystallized in co-complex 
with known 2C9 substrates or inhibitors or novel compounds. The crystal structures of a series 
of such complexes may then be solved by molecular replacement and compared with that of the 
2C9 of Table 1, 2, 3 or 8. Potential sites for modification within the various binding sites of the 
40 enzyme may thus be identified. This information provides an additional tool for determining the 
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most efficient binding interactions, for example, increased hydrophobic interactions, between 
2C9 and a chemical entity or compound. 

For example there are alleles of 2C9, which differ from the native 2C9 by only 1 or 2 amino 
5 acid substitutions, and yet individuals who express these allelic variants may exhibit very 

different drug metabolism profiles. By generating these allelic proteins and determining the co- 
complex with compounds a greater understanding of allelic interactions with compounds may be 
developed, 

1 0 All of the complexes referred to above may be studied using well-known X-ray diffraction 

techniques and may be refined against 1.5 to 3.5 A resolution X-ray data to an R value of about 
0.30 or less using computer software, such as CNX (mentioned above) X-PLOR (Yale 
University, ©1992, distributed by Accelerys - also see, e.g., Blundell et al; Methods in 
Enzymology, vol. 1 14 & 1 1 5, H. W. Wyckoff et al., eds., Academic Press (1985) (23)). 

15 

This information may thus be used to optimise known classes of 2C9 substrates or inhibitors, 
and more importantly, to design and synthesize novel classes of 2C9 inhibitors and design drugs 
with modified P450 metabolism. ; ■ 

20 (ii) In silico analysis and design. 

Although the invention will facilitate the determination of actual crystal structures comprising a 
P450 and a compound which interacts with the P450, current computational techniques provide 
a powerful alternative to the need to generate such crystals and generate and analyse diffraction 
date. Accordingly, a particularly preferred aspect of the invention relates to in silico methods 

25 directed to the analysis and development of compounds which interact with P450 structures of 
the present invention. 

Thus as a result of the determination of the P450 three-dimensional structure, more purely 
computational techniques for rational drug design may also be used to design structures whose 
30 interaction with P450 is better understood (for an overview of these techniques see e.g. Walters 
et al (Drug Discovery Today, Vol.3, No.4, (1998), 160-178). For example, automated ligand- 
receptor docking programs (discussed e.g. by Jones et al. in Current Opinion in Biotechnology, 
Vol.6, (1995), 652-656) which require accurate information on the atomic coordinates of target 
receptors may be used. 

35 

The aspects of the invention described herein which utilize the P450 structure in silico may be 
equally applied to both the 2C9 structure of Table 1, 2, 3 or 8 and the models of target P450 
proteins obtained by other aspects of the invention. Thus having determined a conformation of a 
P450 by the method described above, such a conformation may be used in a computer-based 
40 method of rational drug design as described herein. In addition the availability of the structure 
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of the P450 2C9 will allow the generation of highly predictive pharmacophore models for 
virtual library screening or compound design. 

Accordingly, the invention provides a computer-based method for the analysis of the interaction 
5 of a molecular structure with a P450 structure of the invention, which comprises: 
providing the structure of a P450 of the invention; 
providing a molecular structure to be fitted to said P450 structure; and 
fitting the molecular structure to the P450 structure. 

1 0 The P450 structure of the invention may be the structure of any one of Table 1 , 2, 3, 8 or 1 8 or 
selected coordinates thereof. 

In an alternative aspect, the method of the invention may utilize the coordinates of atoms of 
interest of the P450 which are in the vicinity of . a putative molecular structure binding region in 
15 order to model the pocket in which the structure binds. These coordinates may be used to define 
a space which is then analysed "in silico". Thus the invention provides a computer-based 
method for the analysis of molecular structures which comprises: 

providing the coordinates of at least two atoms of a P450 structure of the invention 
("selected coordinates"); 
20 providing a molecular structure to be fitted to said coordinates; and 

fitting the structure to the selected coordinates of the P450. 

In practice, it will be desirable to model a sufficient number of atoms of the P450 as defined by 
the coordinates of Table 1, 2, 3, 8 or 18 which represent a binding pocket. Binding pockets and 
25 other features of the interaction of P450 with co-factor are described in the accompanying 
example. Thus, in this embodiment of the invention, there will preferably be provided the 
coordinates of at least 5, preferably at least 10, more preferably at least 50 and even more 
preferably at least 100 selected atoms such as at least 500 or at least 1000 atoms of the P450 
structure. 

30 

Although every different compound metabolised by P450 may interact with different parts of the 
binding pocket of the protein, the structured this P450 allows the identification of a number of 
particular sites which are likely to be involved in many of the interactions of P450 with a drug 
candidate. The residues are set out in the accompanying example. Thus in this aspect of the 
35 invention, the selected coordinates may comprise coordinates of some or all of these residues. 

In order to provide a three-dimensional structure of compounds to be fitted to a P450 structure 
of the invention, the compound structure may be modelled in three dimensions using 
commercially available software for this purpose or, if its crystal structure is available, the 
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coordinates of the structure may be used to provide a representation of the compound for fitting 
to a P450 structure of the invention. 

By "fitting", it is meant determining by automatic, or semi-automatic means, interactions 
5 between at least one atom of a molecular structure and at least one atom of a P450 structure of 
the invention, and calculating the extent to which such an interaction is stable. Interactions 
include attraction and repulsion, brought about by charge, steric considerations and the like. 
Various computer-based methods for fitting are described further herein. 

10 More specifically, the interaction of a compound with P450 can be examined through the use of 
computer modelling using a docking program such as GRAM, DOCK, or AUTODOCK (see 
Walters et al., Drug Discovery Today, Vol.3, No.4, (1998), 160-178, and Dunbrack et al., 
Folding and Design, 2, (1 997), 27-42). This procedure can include computer fitting of 
compounds to P450 to ascertain how well the shape and the chemical structure of the compound 

15 will bind to the P450. 

Also computer-assisted, manual examination of the active site structure of P450 may be 
performed. The use of programs such as GRID (Goodford, J. Med Chem., 28, (1985), 849-857) 
- a program that determines probable interaction sites between molecules with various 
20 functional groups and an enzyme surface - may also be used to analyse the active site to predict, 
for example, the types of modifications, which will alter the rate of metabolism of a compound. 

Computer programs can be employed to estimate the attraction, repulsion, and steric hindrance 
of the two binding partners (i.e. the P450 and a compound). 

25 

If more than one P450 active site is characterized and a plurality of respective smaller 
compounds are designed or selected, a compound may be formed by linking the respective small 
compounds into a larger compound which maintains the relative positions and orientations of 
the respective compounds at the active sites. The larger compound may be formed as a real 
30 molecule or by computer modelling. 

Detailed structural information can then be obtained about the binding of the compound to P450, 
and in the light of this information adjustments can be made to the structure or functionality of 
the compound, e.g. to alter its interaction with P450. The above steps may be repeated and re- 
35 repeated as necessary. 

As indicated above, molecular structures which may be fitted to the P450 structure of the 
invention include compounds under development as potential pharmaceutical agents. The 
agents may be fitted in order to determine how the action of P450 modifies the agent and to 
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provide a basis for modelling candidate agents which are metabolised at a different rate by a 
P450. 



Molecular structures which may be used in the present invention will usually be compounds 
5 under development for pharmaceutical use. Generally such compounds will be organic 

molecules which are typically from about 100 to 2000 Da, more preferably from about 100 to 
1000 Da in molecular weight. Such compounds include peptides and derivatives thereof, 
steroids, anti-inflammatory drugs, anti-cancer agents, anti-bacterial or antiviral agents, 
neurological agents and the like. In principle, any compound under development in the field of 
10 pharmacy can be used in the present invention in order to facilitate its development or to allow 
further rational drug design to improve its properties. 

A single reductase provides several different isoforms of P450 with the electrons required in the 
catalytical cycle. As such, knowledge of the cytochrome P450 reductase (CPR) binding site on 
15 P450 and its characteristics present a means of altering the rate of catalysis, by mediating the 
P450 CPR interactions. The structure of 2C9 will allow the in silico identification of residues 
important in the P450 - CPR interface. 

(Hi) Analysis and modification of compounds and metabolites 

20 Where the primary metabolite of a potential or actual pharmaceutical compound is known, and 
this metabolite is generated by the action of P450, the structure of the agent and its metabolite 
may both be modelled and compared to each other in order to better determine residues of P450 
which interact with the agent. In any event, the present invention provides a process for 
predicting potential pharmaceutical compounds with a desired activity which are metabolised by 

25 P450 at a rate different from a starting compound having the same desired activity, which 
method comprises: 

fitting a starting compound to a P450 structure of the invention or selected coordinates 
thereof; 

determining or predicting how said compound is metabolised by said P450 structure; and 
30 modifying the compound structure so as to alter the interaction between it and the P450. 

It would be understood by those of skill in the art that modification of the structure will usually 
occur in silico, allowing predictions to be made as to how the modified structure interacts with 
theP450. 

35 

Modification will be those conventional in the art known to the skilled medicinal chemist, and 
will include, for example, substitutions or removal of groups containing residues which interact 
with the amino acid side chain groups of a P450 structure of the invention. For example, the 
replacements may include the addition or removal of groups in order to decrease or increase the 
40 charge of a group in a test compound, the replacement of a charge group with a group of the 
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opposite charge, or the replacement of a hydrophobic group with a hydrophilic group or vice 
versa. It will be understood that these are only examples of the type of substitutions considered 
by medicinal chemists in the development of new pharmaceutical compounds and other 
modifications may be made, depending upon the nature of the starting compound and its 
5 activity. 

Although it is usually desired to alter a compound to prevent its metabolism by P450, or at least 
to reduce the rate at which P450 metabolises the compound, the present invention also includes 
developing compounds which are metabolised more rapidly than a starting compound, for 
1 0 example where such a compound blocks metabolism of another drug. 

Where a potential modified compound has been developed by fitting a starting compound to the 
P450 structure of the invention and predicting from this a modified compound with an altered 
rate of metabolism, the invention further includes the step of synthesizing the modified 
1 5 compound and testing it in a in vivo or in vitro biological system in order to determine its 
activity and/or the rate at which it is metabolised. 

The above-described processes of the invention may be iterated in that the modified compound 
may itself be the basis for further compound design. 

20 - : v : :, : 

(iv) Fragment linking and growing. 

The provision of the crystal structures of the invention will also allow the development of 
compounds which interact with the binding pocket regions of P450s (for example to act as 
inhibitors of a P450) based on a fragment linking or fragment growing approach. 

25 

For example, the binding of one or more molecular fragments can be determined in the protein 
binding pocket by X-ray crystallography. Molecular fragments are typically compounds with a 
molecular weight between 100 and 200 Da. This can then provide a starting point for medicinal 
chemistry to optimise the interactions using a structure-based approach. The fragments can be 

30 combined onto a template or used as the starting point for 'growing out* an inhibitor into other 
pockets of the protein. The fragments can be positioned in the binding pocket of the P450 and 
then 'grown* to fill the space available, exploring the electrostatic, van der Waals or hydrogen- 
bonding interactions that are involved in molecular recognition. The potency of the original 
weakly binding fragment thus can be rapidly improved using iterative structure-based chemical 

35 synthesis. 

At one or more stages in the fragment growing approach, the compound may be synthesized and 
tested in a biological system for its activity. This can be used to guide the further growing out 
of the fragment. 
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Where two fragment-binding regions are identified, a linked fragment approach may be based 
upon attempting to link the two fragments directly, or growing one or both fragments in the 
manner described above in order to obtain a larger, linked structure which may have the desired 
properties. 

5 

(v) Compounds of the invention. 

Where a potential modified compound has been developed by fitting a starting compound to the 
P450 structure of the invention and predicting from this a modified compound with an altered 
rate of metabolism (including a slower, faster or zero rate), the invention further includes the 
10 step of synthesizing the modified compound and testing it in a in vivo or in vitro biological 
system in order to determine its activity and/or the rate at which it is metabolised. 

In another aspect, the invention includes a compound which is identified by the methods of the 
invention described above. 

15 

Following identification of such a compound, it may be manufactured and/or used in the 
preparation, i.e. manufacture or formulation, of a composition such as a medicament, 
pharmaceutical composition or drug. These may be administered to individuals. 

20 Thus, the present invention extends in various aspects not only to a compound as provided by 
the invention, but also a pharmaceutical composition, medicament, drug or other composition 
comprising such a compound e.g. for treatment (which may include preventative treatment) of 
disease; a method comprising administration of such a composition to a patient, e.g. for 
treatment of disease; use of such an inhibitor in the manufacture of a composition for 

25 administration, e.g. for treatment of disease; and a method of making a pharmaceutical 
composition comprising admixing such an inhibitor with a pharmaceutically acceptable 
excipient, vehicle or carrier, and optionally other ingredients. 

Summary of Examples. 
30 The invention is illustrated by the examples, which illustrate the invention as follows: 

Example 1 shows the production of DNA encoding 2C9trunc, 2C9-FGloop, 2C9-FGloop K206E 
and 2C9P220. 

35 Example 2 shows the expression of 2C9P220 and 2C9-FGloop in bacteria and the recovery of 
protein. 

Example 3 shows quality assays of the proteins of example 2. 
40 Example 4 shows crystallisation conditions used to obtain crystals of 2C9-FGloop. 
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Example 5 shows crystallisation conditions used to obtain crystals of 2C9P220. 

Example 6 shows a further production of 2C9-FGloop and the mass spectrometry and activity 
5 data of the recovered protein. 

Example 7 shows the production.of crystals of 2C9-FGloop. 

Example 8 shows the expression and recovery of 2C9-FGloop K206E and the mass 
10 spectrometry and activity data of the recovered protein, plus crystallisation of the protein. 

Example 9 shows the crystallisation and structure analysis of 2C9-FGloop K206E at a 3 A 
resolution, as set out in Table 1 . 

1 5 Example 1 0 shows a further crystallisation of 2C9-FGloop K206E. 

Example 1 1 shows the production of a higher resolution (2. 6 A) structure of 2C9-FGloop K206E 

Example 12 shows the production of a high resolution (3.1 A) structure of 2C9-FGloop. 

20 

Example 13 identifies residues of the P450 binding pocket and describes their use in the practice 
of the present invention. . : ■. - 

Example 14 describes the use of modelling techniques using structures of the invention. 

25 

Example 15 outlines a docking experiment. 
Example 16 shows the refinement of 2C9-FGloop K206E structure. 
30 Example 17 shows the production of further 2C9 proteins. 
Example 18 shows the production of 2C9 proteins. 
Example 19 shows the activity of 2C9 Proteins of the invention. 

35 

Example 20 shows crystallisation of 2C9 proteins. 
Example 2 1 describes 2C9-2C 1 9 Chimeras. 
40 Example 22 shows the production of 2C9-2C19 chimeras. 
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Example 23 shows validation of 2C9-FGloop K206E. 
Example 24 shows the activity of 2C9-2C19 Chimeras. 

5 

Example 25 shows crystallisation of 2C9-2C1 9 chimeric proteins. 
Example 26 shows homology Modelling of 2C 1 9. 
1 0 Example 27 shows homology modelling of 2C 1 8 . 
Example 28 shows homology modelling of 2C8. . 

Example 1: Production of DNA encoding 2C9 proteins. 

15 Summary 

Cytochrome P450 2C9 was targeted for crystallisation. Conversion of this intrinsic membranous 
protein to a more water-soluble form, by removal of the N-terminus trans-membrane domain 
was performed prior to crystallisation. 

20 Several N-terminus truncations, largely described in the literature, have been used to produce N- 
truncated cytochrome I*450s (including 2E1, 2D6, 2B1 and others). However, most of these N- 
terminal truncations failed to produce fully soluble proteins and in most cases, the truncated 
P450s still remained associated with membranes. 

25 The membrane anchor domain MDSLWLVLCLSCLLLLSLWRQSSGRGKL (SEQ ID 
NO:l 13) present in 2C9 (residues 2 to 29) was substituted by a short hydrophilic peptide 
MAKKTSSKGR (SEQ ID NO: 114). The introduction of a highly charged polypeptide at the N- 
terminus of this protein was found to greatly decrease the membrane association of these 
proteins. It has also been found that the nature of the second codon in a lacZ expression system 

30 influences the level of expression (Looman et al, EMBO J., 6;2489-24992, 1987) and here 
alanine at position 2 provided good expression in E. coll 

Cytochrome P450 exhibits a high tendency to form large aggregates. The N-terminal deletion of 
cytochrome P450 has prevented aggregation and reduced polydispersity. This, in turn, facilitates 
35 the crystallisation of these proteins. 

A four histidine tag was inserted at the C-terminus of 2C9 to help purification in high salt 
buffers. 
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Our preliminary results, using conditions from commercially available screening kits, indicated 
that the apo and native N-terminus truncated 2C9, 2C9trunc, did not produce any useful crystals. 
Thus the protein requires further modifications to promote crystallisation, and more importantly 
to promote production of useful crystals. Accordingly, the FG loop of the protein was 
5 considered for modification. 

The design of the modification in the F-G loop was based on the published results on the 
crystallisation of the rabbit cytochrome P450 that indicated that the F and G helices were 
involved in the formation of a crystal contact. We predicted that the relative position of the F-G 

10 loop in the protein 2C9trunc could interfere with the ability of the F and G helices to constitute 
crystal contacts. It was proposed that the F-G loop, longer and more mobile than the counterpart 
found in the bacterial P450 BM3, may be stabilized or conformational^ changed by six amino 
acid substitutions: Ile215Val, Cys216Tyr, Ser220Pro, Pro221 Ala, Ile222Leu and Ile223Leu. In 
the resultant construct, 2C9-FGloop, the position of proline 220 is moved by one residue. The 

15 proline residue, often reported as initiating changes in secondary structure, may induce a 
conformational change in the F-G loop and facilitate the formation of crystal contacts. In the 
generation of the protein 2C9-P220, the proline is moved from position 221 , as seen in 2C9 wild 
type to position 220 as seen in 2C19 wild type. Thus the serine 220 was mutated to proline and 
proline 221 was mutated to threonine. The introduction of these two changes alone was 

20 sufficient to promote crystallisation. A single mutation of S220P, retaining the proline at 221 
was also sufficient to get crystallisation. v 

In the generation of the protein 1424, the proline is moved from position 221, as seen in 2C9 
wild type to position 222. This shows that the proline can be moved one amino acid either side 
25 of 221 to promote successful crystallisation. 

We believe having a proline at 220 or 222, preferably proline 220 is a critical determinant for 
crystallisation of 2C9. In particular it is a critical determinant for obtaining apo crystals of 2C9. 
It is also important for obtaining diffraction quality crystals of 2C9. Residue 221 can be alanine, 
30 or threonine. It can also be proline or serine. 

The mutagenesis of human 2C9 cytochrome P450 was performed by a variety of standard 
recombinant DNA techniques including cassette mutagenesis, site-directed mutagenesis or 
specific cloning protocols. For cassette mutagenesis, complementary oligonucleotides bearing 
35 the mutations were annealed and cloned, using natural restriction sites or sites that have been 
introduced by PCR mutagenesis into the P450 cDNA. The constructs were verified by 
restriction mapping followed by full sequencing. Other techniques are described herein or are 
well known as such to those of skill in the art. 



WO 03/035693 



49 



PCT/GB02/04872 



N-terminal truncation of? 450 

The expression vector pCWOri+, provided by Prof. F. W. Dahlquist, University of Oregon, 
Eugene, Oregon, USA, was used to express the truncated human cytochrome P450s in the E. 
coli strain XL1 Blue (Stratagene). A full-length cDNAs encoding cytochrome P450 2C9 was 
5 used as a template for PCR amplification, engineering the 5' terminus deletion, insertion of 
silent restriction sites and insertion of a four Histidine tag at the C-terminus. 

A NotI restriction site (underlined) was introduced in 2C9 at position 87 by PCR amplification 
using the following 5 * oligonucleotide: 
10 5 '-ATAAGAATGCGGCCGCCTGGCCCCACTCCTCTCCCAGTGATTGGAAATATC-3 ' 
(SEQIDNO:115). 
The 3' oligonucleotides: 

5 ' -TGCGGTCGACTC AGrGGrGGTGGrGG AC AGG AATG A AGC AG AGCTGGTAG-3 ' 
(SEQ ID NO: 1 16) with a Sail cloning site (underlined) and the four Histidine tag (italics) was 
15 used. A total of 30 cycles at 94 °C for 1 min, 52 °C for 1 min, and 72 °C for 2 min were 

followed by an extension of 10 min at 72 °C. The 1420-bp PCR fragment was double digested 
with Notl/Salland purified by gel agarose elution and extraction. 

The complementary oligonucleotides 

20 5 ' - T ATG GCT AAG AAAACG AGCTCT AAAGGGC-3 1 (SEQ ID NO:l 17) and 
5 ' - GGCC GCCCTTT AGAGCTCGTTTTCTT AGCC A-3 ' (SEQ ID NO:l 18) 
with the Mfe/and NotI overhang restriction sites (underlined) were designed to substitute the 
residues 2-29 of the native N terminus of human cytochrome P450 2C9 by the short 
AKKTSSKGR polypeptide. The oligonucleotides were annealed by mixing 10 fig of each 

25 Oligonucleotide in 1 00 p.1 of water, heating at 100°C for 5 min and slow cooling at room 
temperature. 

The 1420-bp PCR fragment was mixed to the double stranded oligonucleotide and ligated in the 
vector pCWori+, previously digested with Ndel and Sail. An aliquot of the ligation product was 
30 used to transform E. coli XL1 Blue strain to yield the plasmid pCW-2C9trunc that encodes for 
the amino-terminal truncated 2C9. 

The truncated 2C9 was used to make the proteins for further crystallisation experiments. 

35 Construction of 2C9-FGloop 

The plasmid pCW-2C9trunc was used as template for the insertion of six amino acids 
substitutions, Ile215Val, Cys216Tyr, Ser220Pro, Pro221 Ala, Ile222Leu, Ile223Leu in the FG 
loop. pCW-2C9trunc was digested by Ndel and BamHI restriction enzyme and the 579-bp 
corresponding to the 5' terminus of the P450 gene was purified by gel agarose extraction and 

40 elution. A double strand oligonucleotide designed to introduce the six amino acids substitution 
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in the FG loop, was generated by annealing the following complementary oligonucleotides 5'- 
GATCC AGGrCr/jCAATAATTTCCCTGCrcrCCTTCATTATTTC-S' (SEQ ID NO:l 19) and 
5 *- CCGGGA AATAATG4^GG/4 GA GCAGGGAAATTATTGTAGACCYG-l 9 (SEQ JD 
NO:120) with the overhang BamHI and Xmal restriction sites (underlined) and the six mutated 
5 codons (italics). The 579-bp fragment and the double strand oligonucleotide were ligated in the 
vector pC W-2C9trunc, previously digested by Ndel and Xmal. An aliquot of the ligation was 
used to transform XI 1 Blue E. coli and yield the plasmid pCW-2C9-FGloop. 

Construction of 2C9-P220 

10 2C9-P220 is a 2C9trunc mutant carrying the mutations S220P and P221T. This mutant was 
made using the Stratagene Quikchange™ mutagenesis kit (catalogue number #200518), 
according to manufacturers instructions. The Quikchange™ mutagenesis method generates a 
mutated plasmid with staggered nicks and uses Dpnl digestion to remove all parental DNA. 
Reactions were made incorporating 5.0 |iL xlO reaction buffer, 5-50 ng pCW-2C9trunc plasmid 

15 DNA, 1 .0 nL dNTP and 125 ng oligonucleotide primers as follows, with mutated bases shown 
in lowercase and the two amino acid change underlined: 

5' CC AG ATCTGC AATAATTTTcCg^AC ATTGATTACTTCCC 3 ' (SEQIDNO:121) 
5' GGGAAGTAATCAATGATgGtcGgAAAATTATTGCAGATCTGG 3' (SEQ ED NO:122) 
Reactions were made to 50 nL with sterile water, 2.5U Pfu Turbo was then added and the 
20 reaction overlayed with 30 jiL mineral oil. Thermocycling was then carried out as follows: 
95°C, 30 sec (1 cycle), 95°C, 30 sec, 55°C, 1 min, 68°C 13.5 min (18 cycles) and finally a 
holding period at 4°C. A control reaction was also included with water in place of 
oligonucleotide primers. 

25 Following thermocycling 10 U Dpnl was added, under the level of the mineral oil, to each 
reaction. The reactions were then gently mixed followed by centrifugation in a bench top 
microcentrifuge, 1 min, 13,000 jpm and incubated at 37 °C for 3 hr. Digested product (1 [iL) 
was then used to transform 50 |iL competent E. coli XLl-Blue cells. The whole transformation 
as then plated onto Luria agar plates containing 100 |ig/ml carbenicillin, inverted, and incubated 

30 overnight at 37 °C. Colonies were isolated and the plasmid DNA pCW-2C9-P220 isolated and 
sequenced to check for the insertion of the correct mutation. 

Construction of 2C9-FGloop-K206E 

The plasmid pCW-2C9-FGloop was used as a template for the substitution Lys206Glu (where 
35 the numbering is of the full length wild type 2C9, SwissProt: PI 1712, not that of SEQ ID NO:2 
or 4). Primers were designed to lie across the region to be mutated; 
5 -GGAAAAGTTGAATGAAAACATCGAGATTTTGAGCAGCCCCTGG-3 > (SEQ ID 

NO:123) 

5 ' -CC AGGGGCTGCTC AA AATCTCG ATGTTTTC ATTC AACTTTTCC-3 ' (SEQ ID 
40 NO: 124) 
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where the mutated codon is shown in bold. These primers were then used in the protocol for 
Quikchange™ mutagenesis (Stratagene) which is briefly summarised. 
Primers were resuspended to 125 ng/^1 and used in a PCR reaction which elongated around the 
plasmid from the mutagenic primer. The template DNA was then digested using Dpnl, a 

5 methylation specific restriction endonuclease which preferentially degrades the template due to 
its methylation. After Dpnl treatment 1 \d of the resultant sample was transformed into K colt 
XL1 Blue strain. Colonies were picked and sequenced. Plasmids containing the mutation were 
chosen and digested with the restriction endonucleases Ndel and Sail. The Ndel Sail DNA 
fragment corresponding to the coding sequence of the 2C9-FGloop K206E mutant was then sub- 

10 cloned into a pCW vector digested with Ndel and Sail. This served to remove any errors 
incorporated during the PCR phase of the Quickchange mutagenesis. 

Example 2: Expression of 2C9P220 and 2C9-FGloop. 

Bacteria expression 

15 A single ampicillin resistant colony of XL1 blue cells was grown overnight at 37 °C in Terrific 
Broth (TB) with shaking to near saturation and used to inoculate fresh TB media. Bacteria were 
grown to an OD600nm =0.4 in 1 litre of TB broth containing 100 jig/ml of ampicillin at 37 °C at 
185 rpm in 2 litre flask. The haem precursor delta aminolevulinic acid (80 mg/1) was added 30 
min prior to induction with 1 mM isopropyl-p-D-thiogalactopyranoside (IPTG) and the 

20 temperature lowered to 30 °C. The bacterial culture was continued under agitation at 30 °C for 
48 to 72 hours. 

(a) Protein purification 

The cells were pelleted at 10000 g for 10 min and resuspended in a buffer containing 500 mM 
25 KPi, pH 7.4, 20 % glycerol, 1 0 mM mercaptoethanol, 0. 1 % (v/v) of protease inhibitor cocktail 
(Calbiochem), 10 mM imidazole, 0.01 mg/ml DNase 1 and 5 mM MgS0 4 . 

The cells were lysed by passing twice through a Constant Systems Cell Homogeniser at 12000 
psi. The cell debris was then removed by centrifugation at 70000 g at 4 °C for 30 min. 

30 

Detergent 1GEPAL CA630 (Sigma) was added dropwise from a 10% stock solution to the lysate 
at a final concentration of 0.3% (v/v) and the lysate was incubated with previously washed 
NiNTA resin (Qiagen) overnight at 4 °C, using agitation. The protein bound-NiNTA resin was 
pelleted by centrifugation at 2000 g for 2 min at 4 °C. The resin was washed with 20 resin 

35 volumes of 500 mM KPi, pH 7.4, 20% glycerol, 1 0 mM mercaptoethanol, 1 0 mM imidazole, 
1:1000 dilution of protease inhibitor cocktail, 0.3%(v/v) IGEPAL CA630 and the resin pelleted 
by centrifugation at 2000 xg for 2 min at 4 °C. The resin was then washed with 10 resin volumes 
of 500 mM KPi, pH 7.4, 20% glycerol, 10 mM mercaptoethanol, 20 mM imidazole, 0.1% (v/v) 
protease inhibitors, 0.3% IGEPAL CA630 and the resin recovered by centrifugation as 

40 described above. The washing step was repeated as described above with buffer containing 50 
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mM imidazole. The resin was packed into a column at 4 °C and the cytochrome P450 eluted 
with 500 mM KPi, pH 7.4, 20 % glycerol, 10 mM mercaptoethanol, 300 mM imidazole, 0.1% 
(v/v) of protease inhibitor cocktail, 0.3%(v/v) IGEPAL CA630. 

5 (b) An alternative method for protein purification is as follows: 

The cells were pelleted at 10000 g for 10 min and resuspended in a buffer containing 500 mM 
KPi, pH 7.4, 20 % glycerol, 10 mM mercaptoethanol, 0.1% (v/v) of protease inhibitor cocktail 
(Calbiochem), 0.01 mg/ml DNase 1 and 5 mM MgS0 4 . 

1 0 The cells were lysed by passing twice through a Constant Systems Cell Homogeniser at 1 2000 
psi. The cell debris was then removed by centrifiigation at 70000 g at 4°C for 30 min. 

Detergent IGEPAL CA630 (Sigma) was added dropwise from a 10% stock solution to the lysate 
at a final concentration of 0.3% (v/v) and the lysate was incubated with previously washed 

1 5 NiNTA resin (Qiagen) overnight at 4°C, using agitation. The NiNTA resin was pelleted by 
centrifiigation at 2000 g for 2 min at 4 °C and washed, as described above, with 20 resin 
volumes of 500 mM KPi, pH 7.4, 20% glycerol, 10 mM mercaptoethanol, 50 mM glycine 0.1% 
(v/v) protease inhibitors, 0.3% IGEPAL CA630, followed by washing with 10 resin volumes of 
500 mM KPi, pH 7.4, 20% glycerol, 10 mM mercaptoethanol, 7.5 mM Histidine, 0.1% (v/v) 

20 protease inhibitors, 0.3% IGEPAL CA630. The resin was recovered by centrifiigation between 
washing steps and then the resin was packed into a column at 4 °C. The protein was eluted with 
500 mM KPi, pH 7.4, 20 % glycerol, 10 mM mercaptoethanol, 100 mM histidine, 0.1% (v/v) of 
protease inhibitor cocktail, 0.3%(v/v) IGEPAL CA630. 

25 The cytochrome P450 obtained from the NiNTA column by either elution protocol was quickly 
desalted (<10 min) into 10 mM KPi, pH 7.4, 20% glycerol, 0.2 mM DTT, 1 mM EDTA using a 
HiPrep 26/10 desalting column (Pharmacia), at a flow rate of 5 ml/min and collecting 16 ml 
fractions. The desalted cytochrome P450 was directly applied to a CM Sepharose column 
(Pharmacia), previously equilibrated with 10 mM KPi, pH 7.4, 20% glycerol, 0.2 mM DTT, 1 

30 mM EDTA. The following step elution was applied: wash with 1 0 column volumes of 10 mM 
KPi, pH 7.4, 20% glycerol, 0.2 mM DTT, 1 mM EDTA, wash with the above buffer with 75 
mM KC1 in order to remove any trace of detergent, then eluted with the above buffer with KC1 
concentration increased to 500 mM. The protein was concentrated up to 40 mg/ml using a 
microconcentrator for crystallization assays. 

35 

At this stage, the protein can be optionally further purified by running a gel filtration column. 
The concentrated P450 sample was applied on the top of a Superose 6 HR 10/30 gel filtration 
column (Pharmacia) and eluted at 0.2 ml/min with buffer containing 100 mM KPi, pH 7.4, 300 
mM KC1, 20% glycerol, 0.2 mM DTT. The protein was collected and concentrated up to 40 
40 mg/ml, as described above, for crystallization and quality assays. 
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Example 3: Quality Assays, 

The quality of the final preparation of proteins from Example 2 was evaluated by: 

5 (a) SDS polyacrylamide gel electrophoresis. 

This was performed using commercial gels (Nugen) followed by CBB (coomassie brilliant blue) 
staining according to the manufacturer's instructions. The purity as estimated by scanning a 
digital image of a gel was estimated to be at least 95%. 

10 . (b)Gel filtration chromatography. 

This was done using a Superose 6 HR10-30 column (Pharmacia) was performed to assess the 
aggregation state. The fractionation range for this column is 5xl0 3 to 5xl0 6 Da and is thus well 
adapted to the resolution of large complexes. The column was eluted at 0.2 ml/min with buffer 
containing 100 mM KPi, pH 7.4, 300 mM KC1, 20% glycerol, 0.2 mM DTT, 1 mM EDTA. 0.2 

1 5 ml protein samples at a concentration of approximately 40 mg/ml were used. Absorbance at 280 
nm was monitored arid the peak was collected and analysed using dynamic light scattering. 

(c)Light scattering. 

Samples (0.15 ml) collected after the CM Sepharose column and/or the gel titration column 
20 were analysed by DLS in fluorimeter quartz cells at 90° using laser radiation at 830.3 nm. Data 
was collected using a log correlator with variable expansion spanning a wide dynamic range. All 
measurements were performed at 20 °C with samples collected immediately from the gel 
filtration column. A run was on average 10 runs of 10 seconds each. To obtain an estimation of 
the molecular weight, we used a factional ratio of 1.26 and a partial specific volume of 0.726. 

25 

Samples prepared using our new method of purification possessed a good solubility and an 
absence of significant aggregation as shown by; 

- the ratio of far channel extrapolation and measured average scattering was always between 
30 0.999 and 1.003. 

- the average count rate did not vary significantly, with approximately 1% standard deviation. 

- analysis of the autocorrelation function using bi exponential fitting showed that 2C9 had an 
estimated Mr of approximately 180 KDa, i.e. it is an oligomer composed of no more than four 
subunits. 

35 - good stability of the samples (over 24h) at 20°C. 

Samples prepared by published protocols showed signs of a severe aggregation: 

- large fluctuations of the scattered light intensity, with a standard deviation of more than 10%. 
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- analysis of the autocorrelation function showed very slow exponential decay an indication of 
the presence of large aggregates (Mr > 10 6 Da), composed of a large number of P450 subunits. , 
These samples also show a high degree of polydispersity. 

- samples also showed further aggregation as a function of time. 

5 

These signs of severe aggregation in samples prepared by published methods were still present 
after sample filtration through 20 nm diameter pores or centrifugation at 200,000g for 30 min. 

(d) Mass Spectroscopy 

10 Mass spectroscopy was performed on a single quadrupole mass spectrometer (platformll, 

Micromass UK Ltd.). Samples (25 nl of purified protein at 25-60 mg/ml) were dialyzed against 
0.1 M ammonium acetate at 4 °C for 4 hours, using microcell dialyser (Pierce). The samples 
were diluted by a factor of 1 00 in 1 : 1 v/v methanols. 1 % aqueous formic acid and were then 
infused into the ionisation source of the mass spectrometer with a flow rate of 20 nl/min. 

15 

The mass spectrometer was fitted with a standard electrospray ionisation source. Positive 
electrospray ionisation was affected with a probe tip voltage of 3.5 kV, and a counter electrode 
voltage of 0.5 kV. Nitrogen was employed as both the nebulising and the drying gas, with a 
nebulising gas flow rate of 20 L/hr and a drying flow rate of 200L/hr. The sampling cone 
20 voltage was maintained at 40V . Data were acquired over the appropriate m/z range and were 
subsequently processed by mamial identification of the components wherever possible, followed 
by transposition onto a true molecular mass scale for more facile identification using Maximum 
Entropy processing techniques. The mass accuracy obtained for the analysed protein was 0.01% 
of the mass. 

25 

(e) Functionality assays 

Activity assays on P450 2C9 were performed in a 96-well plate assay format with a Fluoroscan 
Ascent FL Instruments (Labsystem), using the methoxy-4-(trifluoromethyl)-coumarin as a 
fluorescent substrate. 

30 

. Fifteen pmoles of P450 were reconstituted with 0.1 unit of purified human oxidoreductase, in 
presence of 140 \M of substrate methoxy-4-(trifluoromethyl)-coumarin, aNADPH regenerating 
system that includes 0.15 mM NADP + , 0.38 mM Glucose-6-phosphate and 2.9 unit/ml glucose- 
6-phosphate dehydrogenase in 170 jil final volume of 25 mM KPi, pH 7.4, 0.38 mM MgCl 2 . 
35 Incubations were performed at 37°C for several minutes and 7-hydroxy -4-(trifluoromethyl)- 
coumarin was used as metabolite standard to determinate the metabolic rate. The excitation and 
emission wavelengths used were respectively 409 and 530nm. 



WO 03/035693 PCT/GB02/04872 

55 

Example 4: Crystallisation conditions for 2C9-FGloop. 

Crystallization of P450 2C9-FGloop was achieved at 10-60 mg/ml protein in 10 mM Potassium 
phosphate, pH 7.4; 0.5 M KC1; 0.2 mM DTT; 1.0 mM EDTA; 20% glycerol against the 
conditions listed below. Crystals grew over a two week period in the morphologies indicated. 

5 

Appearance: Needles and rods 

Cell dimensions: a=161 A, b= 161 A, c=l 10 A, a=90°, p=90°, y=120°. 
Space Group: P321 

10 0.2 M Sodium Fluoride, 20% PEG 3350 

0.2 M Potassium Fluoride, 20% PEG 3350 

0.2 M Ammonium Fluoride, 20% PEG 3350 

0.2 M Lithium Chloride, 20% PEG 3350 

0.2 M Magnesium Chloride, 20% PEG 3350 
15 0.2 M Sodium Chloride, 20% PEG 3350 

2.0 M Sodium Chloride, 1 0% PEG 6K 

0.2 M Calcium Chloride, 20% PEG 3350 

0.2 M Potassium Chloride, 20% PEG 3350 

0.2 M Ammonium Chloride, 20% PEG 3350 
20 0.2 M Lithium Nitrate, 20% PEG 3350 

0.2 M Magnesium Nitrate, 20% PEG 3350 

0.2 M Sodium Nitrate, 20% PEG 3350 . 

0.2 M Potassium Nitrate, 20% PEG 3350 

0.2 M Ammonium Nitrate, 20% PEG 3350 
25 0.2 M Magnesium Formate, 20% PEG 3350 

0.2 M Sodium Formate, 20% PEG 3350 

0.2 M Potassium Formate, 20% PEG 3350 

0.2 M Ammonium Formate, 20% PEG 3350 

0.2 M Lithium Acetate, 20% PEG 3350 
30 0.2 M Magnesium Acetate, 20% PEG 3 350 

0.2 M Sodium Acetate, 20% PEG 3350 

0.2 M Sodium Acetate pH 4.6, 10-20% PEG 4000 

0.2 M Calcium Acetate, 20% PEG 3350 

0.2 M Potassium Acetate, 20% PEG 3350 
35 0.2 M Ammonium Acetate, 20% PEG 3350 

0.2 M Ammonium Acetate pH 4.6, 10-20% PEG 4000 

0.2 M Sodium Sulfate, 20% PEG 3350 

0.2 M Magnesium Sulfate, 20% PEG 3350 

0.2 M Potassium Sulfate, 20% PEG 3350 
40 0.2 M Ammonium Sulfate, 20% PEG 3350 
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0.2 M di-Sodium Tartrate, 20% PEG 3350 

0.2 M Potassium Sodium Tartrate, 20% PEG 3350 

0.2 M di-Ammonium Tartrate, 20% PEG 3350 

0.2 M Sodium dihydrogen Phosphate, 20% PEG 3350 
5 0.2 M di-Sodium hydrogen phosphate dihydrate, 20% PEG 3350 

0.2 M Potassium dihydrogen Phosphate, 20% PEG 3350 

0.2 M di-Potassium hydrogen Phosphate, 20% PEG 3350 

0.2 M Ammonium dihydrogen Phosphate, 20% PEG 3350 

0.2 M di-Ammonium hydrogen Phosphate, 20% PEG 3350 
10 0.2 M tri-Lithium Citrate, 20% PEG 3350 

0.2 M tri-Sodium Citrate, 20% PEG 3350 

0.2 M tri-Potassium Citrate, 20% PEG 3350 

15% PEG 1500 

30% PEG 1500 
15 0. 1 M MES pH 6.0, 5-20% PEG 6000 

0.1 M MES pH 6.5, 12% PEG 20,000 

0. 1 M Citric acid pH 5.0, 1 0% PEG 6000 

0.1 M Sodium Cacodylate, pH 6.6, 10-25% PEG 1500 

0.1 M Sodium Cacodylate, pH 6.4-6.8, 0.05-0.2 M Magnesium acetate, 10-20% PEG 8000 
20 0.05-0. 1 M Potassium dihydrogen phosphate, 1 0-20% PEG 8000 
0.2 M Potassium dihydrogen phosphate, 20% PEG 3000 

0-0.2 M Sodium Chloride, 0.1 M Potassium dihydrogen phosphate/di-Sodium hydrogen 
phosphate pH 5.8-6.6, 5-20% PEG 8000 

0-0.2 M Sodium Chloride, 0.1 M Potassium dihydrogen phosphate/di-Sodium hydrogen 
25 phosphate pH 5.8-6.6, 20% PEG 1000 

0-0.2 M Sodium Chloride, 0.1 M Potassium dihydrogen phosphate/di-Sodium hydrogen 
phosphate pH 5.8-6.6, 20% PEG 3350- 

0-0.2 M Sodium Chloride, 0.1 M Potassium dihydrogen phosphate/di-Sodium hydrogen 

phosphate pH 5.8-6.6, 15-20% PEG 5000MME 
30 0.5 M Ammonium Sulfate, 0.1 M HEPES pH 7.5, 30% 2-Methyl-2,4-pentanediol 

0.01 M Nickel (II) Chloride, 0.1 M Tris pH 8.5, 20% PEG MME 2000 

0.05-0.2 M Calcium Acetate, 0.1 M Tris HC1 pH 7.0-7.6, 10-22.5% PEG 3000 

0.1 M phosphate-citrate, pH 4.2, 0.05 M Lithium sulphate, 20% PEG 1000 

0.025-0.25 M di-potassium hydrogen phosphate, pH 7.0-7.8 
35 0.2 M di-potassium hydrogen phosphate, pH 8.4, 1 7.5% PEG 3350 

0.2 M Ammonium iodide, 20% PEG 3350 

0.2 M di-Ammonium hydrogen citrate, 20% PEG 3350 

0.2 M Lithium sulphate, 20% PEG 3350 

0.05-0.2 M K2HP04, 10% PEG 4000 
40 0.05-0.2 M K2HP04, 6.25%-20% PEG 3350 
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0.2 MK2HP04, 3.75-25% PEG 3350 

0.2-0.35 M K2HP04, 20% PEG 3350 

0.1-0.15 M K2HP04, 10% MPEG 2000 

0.2MK2HPO4, 3.75-10% MPEG 2000 
5 0.5 MK2HP04, 10% MPEG 2000 

0. 1 -0. 1 5 M K2HP04 , 1 0% PEG 1 000 

0.2 M K2HP04, 3.75-10% PEG 1000 

0.5MK2HPO4,10%PEG1000 

0.1M Citrate-HCl pH 5.6, 20% PEG 3000 
10 0.1 MTris-HClpH 7.0, 20% MPEG 2000 

0.1 M HEPES pH 7.5, 0.2 M sodium chloride, 20% PEG 3000 

0.1 M Imidazole-HCl pH 8.0, 0.2 M Calcium acetate, 10% PEG 8000 

0. 1 M Imidazole-HCl pH 8.0, 10% Iso-Propanol 

0.1 M Imidazole-HCl pH 6.5, 0.5 M Sodium acetate 
15 0. 1 M Sodium cacodylate pH 6.6, 20% PEG 3350 

0.1 M Citrate-HCl pH 5.6, 10% PEG 4000, 10% Isopropanol 

0. 1 M Tris-HCl pH 7.0-7.6, 0.1 -0.2 M Calcium acetate, 1 5-20% PEG 3000 

0.1 M phosphate-citrate pH 4.2, 0.2 M Lithium sulphate, 10% 2-propanol 

0.1 M citrate pH 5.5, 0.2 M Lithium sulphate, 15% ethanol ; 
20 0.1 M HEPES pH 7.5, 0.2 M Magnesium chloride, 15% ethanol 

20% PEG 300, 10% Glycerol, 0.1 M Tris pH8.5, 5% PEG 8000 

Example 5: Crystallisation conditions for 2C9P220. 

Crystallization of P450 2C9P220 was achieved at 10-60 mg/ml protein in 10 mM Potassium 
25 phosphate, pH 7.4; 0.5 M KC1; 0.2 mM DTT; 1 .0 mM EDTA; 20% glycerol against the 

conditions listed below. Crystals grew over a two week period in the morphologies indicated. 

Appearance: Spherical clusters 

30 0.1 M Tris-HCl pH 8.5, 0.2 M sodium acetate, 1 5% PEG 4000 
0.1 M Tris-HCl pH 8.5, 4% PEG 8000 

0. 1 M Tri-Sodium Citrate Dihydrate pH 5.6, 10% Iso-PropanolPEG 4000 
0. 1M HEPES pH 7.5, 0.2 M sodium chloride, 20% PEG 3000 
0. 1 M Na/K phosphate pH 6.2, 10% PEG 3000 
35 0.1M Tris pH 7.0, 0.2 M calcium acetate, 20% PEG 3000 
0.1M Tris pH 8.5, 20% PEG 1000 

0. 1 M HEPES pH 7.5, 0.2 M sodium chloride, 30% PEG 400 
0.2M di-Sodium tartrate, 20% PEG 3350 
0.2 M di-Sodium hydrogen phosphate dihydrate, 20% PEG 3350 
40 0.2 M di-Potassium hydrogen phosphate, 20% PEG 3350 
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0.2 M tri-Lithium citrate, 20% PEG 3350 
0.2 M tri-Sodhta citrate, 20% PEp 3350 
0.2 M tri-Potassium citrate, 20% PEG 3350 
0. 1M Tris-HCl pH 7.0, 0.2 M Calcium acetate, 20% PEG 3000 
5 0.2 M K2 H P04, 15% PEG 3350 
0.2M K2 H P04, 15% PEG 3350 

0.1M Tris-HCl pH 7.2, 0.2 M Calcium acetate, 20% PEG 3000 
0.1M Tris-HCl pH 7.2, 0.2 M Calcium acetate, 15% PEG 3000 
0.2MK2HPO4, 17.5% PEG 3350 ' 
10 0.2 MK2HPO4,20%PEG 3350 
0.3M K2 H PO4,20%PEG 3350 
0.2 MK2HP04, 22.5% PEG 3350 
0.2 M K2 H P04, 25% PEG 3350 

0.1M Tris-HCl pH 7.6, 0.2 M Calcium acetate, 20% PEG 3000 
15 0.1 M Tris-HCl pH 7.6, 0.2 M Calcium acetate, 15% PEG 3000 
0.1 M Tri-Sodium Citrate Dihydrate pH 5.0, 5% PEG 4000 
0.1 M HEPES 7.0, 5% PEG 4000 
0.1 M Tris pH 8.0, 5%-15% PEG 4000 
0.1 M Bis-Tris Propane pH 9.0, 5%-10% PEG 4000 

20 

Example 6: Further production of 2C9-FGIoop. 

2C9-FGloop was prepared in, and recovered from, a bacterial expression system as described in 
Example 2(a) above, and subject to further analysis by mass spectroscopy and an activity assay. 

25 Mass Spectroscopy 

Mass spectroscopy was performed using a Bruker "BioTOF" electrospray time of flight 
instrument. Samples were either diluted by a factor of 1000 straight from storage buffer into 
methanol/water/formic acid (50:48:2 v/v/v), or subjected to reverse phase HPLC separation 
using a C4 column. Calibration was achieved using Bombesin and angiotensin I using the 2+ 

30 and 1+ charge state. Data were acquired between 200 and 2000m/z range and were subsequently 
processed using Bruker 1 s X-mass program. Mass accuracy was typically below 1 in 10 000. 

Mass spec of 2C9-FGloop: 53967 Da (observed) 

53963.72 Da (predicted) 

35 

Functionality assays 

Activity assays on P450 2C9 were performed in a 96-well plate assay format with a Fluoroscan 
Ascent FL Instruments (Labsystem), using the methoxy-4-(trifluoromethyl)-coumarin as a 
fluorescent substrate. 
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Fifteen pmoles of P450 were reconstituted with 0.1 unit of purified human oxidoreductase, in 
presence of 140 \M of substrate methoxy-4-(trifluoromethyl)-coumarin, a NADPH regenerating 
system that includes 0.15 mM NADP + , 0.38 mM Glucose-6-phosphate and 2.9 unit/ml glucose- 
6-phosphate dehydrogenase in 170 ^1 final volume of 25 mM KPi, pH 7.4, 0.38 mM MgCl 2 . 
5 Incubations were performed at 37°C for several minutes and 7-hydroxy-4-(trifluoromethyl)- 
coumarin was used as metabolite standard to determinate the metabolic rate. The excitation and 
emission wavelengths used were respectively 409 and 530nm. The activity of 2C9-FGloop was 
0.1 10 pmol/min/pmol P450 with 2C9 substrate. 

10 Example 7: Crystals of 2C9-FGloop. 

Crystals of the 2C9-FGloop were grown using the hanging drop vapour diffusion method. 
Protein from example 6 at 40mg/ml in lOmM Kpi pH 7.4, 0.5 M KC1, 2mM DTT, ImM EDTA, 
20% glycerol, was mixed in a 1 : 1 ratio, using 0.5 \il drops, with a reservoir solution. The 
crystals of 2C9-FGloop grew over a reservoir solution containing 0.1 M Tris-HCl, pH 8.8; 15% 

15 PEG 400; 5% PEG 8000; 10% glycerol. Crystals formed within 1-7 days at 25°C, and had 
morphologies of hexagonal needles and rods. In a first experiment, a first crystal ("1"), was 
found to have approximate cell dimensions of 161 A, 161 A, 1 10 A, 90°, 90°, 120°. In a second 
experiment, a second crystal ("2"), was found to have approximate cell dimensions of 164 A, 
164 A, 1 1 1 A, 90°, 90°, 120°. This illustrates a typical range of variation within the 5% 

20 variability mentioned above. 

The crystals were flash frozen in liquid nitrogen, using 80% reservoir solution, 10% PEG 400 
and 10% glycerol as a ciyoprotectant. 

25 Data was collected from a 2C9-FGloop crystal to 3.3 A resolution at beamline ID 14. 1 

(wavelength 0.933 A) at the European Synchrotron Radiation Source using a Quantum4 CCD 
detector from a single crystal at 100K. The crystals belong to spacegroup P321 . Crystal 1 was 
found to have cell dimensions 161.35 A, 161.35 A, 110.75 A, 90°, 90°, 120°; in the case of 
crystal 2 the dimensions were 163.95 A, 163.95 A, 1 1 1.06 A, 90°, 90°, 120° and the data were 

30 collected to 3.0 A for the crystal. 

Coordinates of Table 1 or 2 can be used to solve the structure of 2C9-FGloop by molecular 
replacement. 

35 Example 8: Crystallisation and structure analysis of 2C9-FGloop K206E. 

E. coli transformed with the 2C9-FGloop K206E vector described above were grown and 
described in Example 2. 
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Protein Purification 

The cells were pelleted at 10000 g for 10 min and resuspended in a buffer containing 500 mM 
KPi, pH 7.4, 20 % glycerol, 10 mM mercaptoethanol, 0.1% (v/v) of protease inhibitor cocktail 
(Calbiochem), 10 mM imidazole, 40U/ml DNase 1 and 5 mM MgSCV 

5 

The cells were lysed by passing twice through a Constant Systems Cell Homogeniser at 12000 
psi. The cell debris was then removed by centrifugation at 70000 g at 4°C for 30 min. 

Detergent IGEPAL CA630 (Sigma) was added dropwise from a 1 0% stock solution to the lysate 
10 at a final concentration of 0.3% (v/v) and the lysate was incubated with previously washed 
NiNTA resin (Qiagen) overnight at 4°C, using agitation. The protein bound-NiNTA resin was 
pelleted by centrifugation at 2000 g for 2 min at 4°C. The resin was washed with 20 resin 
volumes of 500 mM KPi, pH 7.4, 20% glycerol, 10 mM mercaptoethanol, 10 mM imidazole, 
1:1000 dilution of protease inhibitor cocktail, 0.3%(v/v) IGEPAL CA630 and the resin pelleted 
15 by centrifugation at 2000 xg for 2 min at 4°C. The resin was then washed with 10 resin volumes 
of 500 mM KPi, pH 7.4, 20% glycerol, 10 mM mercaptoethanol, 20 mM imidazole, 0.1% (v/v) 
protease inhibitors, 0.3% IGEPAL CA630 and the resin recovered by centrifugation as 
described above. 

20 The resin was packed into a column at 4°C and the cytochrome P450 eluted with 500 mM KPi, 
pH 7.4, 20 % glycerol, 10 mM mercaptoethanol, 300 mM imidazole, 0.1% (v/v) of protease 
inhibitor cocktail, 0.3%(v/v) IGEPAL CA630. 

The cytochrome P450 obtained from the NiNTA column by either elution protocol was quickly 
25 desalted into 1 0 mM KPi, pH 7.4, 20% glycerol, 2.0 mM DTT, 1 mM EDTA using a HiPrep 
26/10 desalting column (Pharmacia), at a flow rate of 5 ml/min and collecting 17 ml fractions. 

The desalted cytochrome P450 was directly applied to a CM Sepharose column (Pharmacia), 
previously equilibrated with 10 mM KPi, pH 7.4, 20% glycerol, 2.0 mM DTT, 1 mM EDTA. 
30 The following step elution was applied: wash with 10 column volumes of 10 mM KPi, pH 7.4, 
20% glycerol, 2.0 mM DTT, 1 mM EDTA, wash with the above buffer with 75 mM KC1 in 
order to remove any trace of detergent, then eluted with the above buffer with KC1 concentration 
increased to 500 mM. 

35 The protein was concentrated up to 40 mg/ml using a microconcentrator for crystallization 
assays. To characterize the protein, the quality of the final preparation was evaluated by: 
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(a) SDS polyacrylamide gel electrophoresis 

This was performed using commercial gels (Nugen) followed by CBB staining according to the 
manufacturer's instructions. The purity as estimated by scanning a digital image of a gel was 
estimated to be at least 95%. 

5 

(b) Mass Spectroscopy 

Mass spectroscopy was performed using a Bruker "BioTOF" electrospray time of flight 
instrument. Samples were either diluted by a factor of 1000 straight from storage buffer into 
methanol/water/formic acid (50:48:2 v/v/v), or subjected to reverse phase HPLC separation 
1 0 using a C4 column. Calibration was achieved using Bombesin and angiotensin I using the 2+ 
and 1+ charge state. Data were acquired between 200 and 2000m/z range and were subsequently 
processed using Broker's X-mass program. Mass accuracy was typically below 1 in 10 000. 

Mass spec of 2C9-FGloop-K206E: 53966 Da (observed) 
15 53964.67 Da (predicted) 

(c) Functionality assays 

Activity assays on P450 2C9 were performed in a 96-well plate assay format with a Fluoroscan 
Ascent FL Instruments (Labsystem), using the methoxy-4-(trifluoromethyl)-coumarin as a 
20 fluorescent substrate. 

Fifteen pmoles of P450 were reconstituted with 0.1 unit of purified human oxidoreductase, in 
presence of 140 nM of substrate methoxy-4-(trifluoromethyl)-coumarin, a NADPH regenerating 
system that includes 0.15 mM NADP + , 0.38 mM Glucose-6-phosphate and 2.9 unit/ml glucose- 
25 6-phosphate dehydrogenase in 170 ^il final volume of 25 mM KPi, pH 7.4, 0.38 mM MgCh. 
Incubations were performed at 37°C for several minutes and 7-hydroxy -4-(trifluoromethyI)- 
coumarin was used as metabolite standard to determinate the metabolic rate. The excitation and 
emission wavelengths used were respectively 409 and 530 nm. The activity of the 2C9-FGloop- 
K206E was 0.083 pmol/min/pmol P450 with 2C9 substrate. 

30 

Crystallization of2C9-FGLoop-K206E 

Crystals of the 2C9-FGloop~K206E were grown using the hanging drop vapour diffusion 
method. Protein at 40mg/ml in 1 OmM Kpi pH 7.4, 0.5 M KC1, 2mM DTT, 1 mM EDTA, 20% 
glycerol, was mixed in a 1 : 1 ratio, using 0.5 jil drops, with a reservoir solution. The crystals of 

35 2C9-FGloop-K206E grew over a reservoir solution containing 0.2 M dibasic potassium 

phosphate and 20% PEG 3350 (Alternative conditions were also used, which were 0.1 M Tris- 
HC1, pH 8.5; 0.2 M LiS04; 15% PEG 4000). Crystals formed within 1-7 days at 25°C, and had 
morphologies of hexagonal needles and rods. The approximate cell dimensions of the crystals 
were 165 A, 165 A, 1 12 A, 90°, 90°, 120°. The crystals were flash frozen in liquid nitrogen, 

40 . using 80% reservoir solution, 10% PEG 400 and 10% glycerol as a cryoprotectant. 
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Example 9: Structure of 2C9-FGloop K206E. 

Data was collected from a 2C9-FGloop-K206E crystal (prepared as described in Example 8) to 
3.0 A resolution at beamline ED14.1 (wavelength 0.933 A) at the European Synchrotron 
5 Radiation Source using a Quantum4 CCD detector from a single crystal at 1 00K. A total of 90 
one degree oscillation images were collected and processed using MOSFLM 6.1 1 (Leslie, A. G. 
W. (1992). Jnt CCP4/ESF-EA CMB Newslett Protein Crystallogr. 26), scaled using SCALA 
4.1, and reduced using the CCP4 suite of programs (Collaborative Computational Project, 
Number 4, (1994). The CCP4 suite: programs for protein crystallography. Acta CrysL D50, 
10 760-763). 



Table of data statistics 



Resolution 


15-3.0 A 


3.16-3.0 A 


Completeness (%) 


99.4 


98.7 


Multiplicity 


52 


4.8. 


l/Sigma(I) 


3.5 


1.3 


Rmerge (%) 


12.7 


54.2 



The crystals belong to spacegroup P321 and have cell dimensions 165.46 A, 165.46 A, 
15 1 1 1 .70 A, 90°, 90°, 1 20°! There are two copies in the asymmetric unit, and the crystals have a 
solvent content of 68%. The structure was solved by molecular replacement using the 2C5 
structure (pdbid 1DT6) (Williams, P A; Cosme, J; Sridhar, V; Johnson, E F; McRee, D E, 
Molecular Cell, Volume 5, Issue J , January 2000,Pages 121-131) and the program AMORE 
(Navaza, J. (1994). AMoRe: an automated package for molecular replacement. Acta CrysL A50, 
20 157-163), giving a correlation coefficient of 67.8% and an R-factor of 38.9%. The coordinates 
of the structure are set out in Table 1 . The two copies in the asymmetric unit are related by a 
rotation of 145° about the Z-axis. The initial maps (both averaged and unaveraged) were 
relatively clean, and containing unmistakable electron density for the heme group which was 
omitted from the search model. This solution was using as a starting point for refinement using 
25 the program CNX (ibid). 

Example 10: Further crystallisation of2C9-FGloop K206E. 

Bacteria Expression 

A single ampicillin resistant colony of XL1 blue cells transformed with the 2C9-FGloop 
30 K206E-expressing plasmid described above was grown overnight at 37°C in Terrific Broth (TB) 
with shaking to near saturation and used to inoculate fresh TB media. Bacteria were grown to 
an OD600nm = 0.4 in 1 litre of TB broth containing 100 ng/ml of ampicillin at 37°C at 185 rpm 
in 2 litre flask. The heme precursor delta aminolevulinic acid (80 mg/1) was added 30 min prior 
to induction with 1 mM isopropyl-P-D-thiogalactopyranoside (IPTG) and the temperature 
35 lowered to 25°C. The bacterial culture was continued under agitation at 25°C for 72 hours. 
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Protein Purification 

The cells were pelleted at 10000 g for 10 min and resuspended in a buffer containing 500 mM 
KPi, pH 7.4, 20% glycerol, 10 mM mercaptoethanol, 0.1% (v/v) of protease inhibitor cocktail 
(Calbiochem), 10 mM imidazole, 40 U/ml DNase 1 and 5 mM MgS0 4 . 

5 

The cells were lysed by passing twice through a Constant Systems Cell Homogeniser at 1 0000 
psi. The cell debris was then removed by centrifugation at 22000 x g at 4°C for 30 min. 

Detergent IGEPAL CA630 (Sigma) was added dropwise from a 10% stock solution to the lysate 
10 at a final concentration of 0.3% (v/v) and the lysate was incubated with previously washed 
NiNTA resin (Qiagen) overnight at 4°C, using agitation. The protein bound-NiNTA resin was 
pelleted by centrifugation at 2000 g for 2 min at 4°C. The resin was washed with 30 resin 
volumes of 500 mM KPi, pH 7.4, 20% glycerol, 10 mM mercaptoethanol, TO mM imidazole, 
1:1000 dilution of protease inhibitor cocktail, 0.3%(v/v) IGEPAL CA630 and the resin pelleted 
15 by centrifugation at 2000 xg for 2 min at 4°C. The resin was then washed with 1 5 resin volumes 
of 500 mM KPi, pH 7.4, 20% glycerol, 10 mM mercaptoethanol, 20 mM imidazole, 0.1% (v/v) 
protease inhibitors, 0.3% IGEPAL CA630 and the resin recovered by centrifugation as 
described above. 

20 The resin was packed into a column at 4°C and the cytochrome P450 eluted with 500 mM KPi, 
pH 7.4, 20 % glycerol, 10 mM mercaptoethanol, 300 mM imidazole, 0.1% (v/v) of protease 
inhibitor cocktail, 0.3%(v/v) IGEPAL CA630. 

The cytochrome P450 obtained from the NiNTA column was quickly desalted into 10 mM KPi, 
25 pH 7.4, 20% glycerol, 2.0 mM DTT, 1 mM EDTA using a HiPrep 26/1 0 desalting column 
(Pharmacia), at a flow rate of 5 ml/min. 

The desalted cytochrome P450 was directly applied to a CM Sepharose column (Pharmacia), 
previously equilibrated with 10 mM KPi, pH 7.0, 20% glycerol, 2.0 mM DTT, 1 mM EDTA. 
30 The following step elution was applied: wash with 20 column volumes of 1 0 mM KPi, pH 7.0, 
20% glycerol, 2.0 mM DTT, 1 mM EDTA, wash with the above buffer with 75 mM KC1 in 
order to remove any trace of detergent, then eluted with the above buffer with KC1 concentration 
increased to 500 mM. 

35 The protein was concentrated up to 40 mg/ml using a microconcentrator for crystallization 
assays. 

Crystallization of2C9-FGloop K206E 

Crystals of the 2C9-FGloop-K206E were grown using the hanging drop vapour diffusion 
40 method. Protein at 40 mg/ml in lOmM Kpi pH 7.0, 0.5 M KC1, 2mM DTT, ImM EDTA, 20% 
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glycerol, was mixed in a 1:1 ratio, using 0.5 p.1 drops, with a reservoir solution. The crystals of 
2C9-FGloop-K206E were grown over a reservoir solution containing: 0.1 M Tris-HCl pH 8.4, 
1 5% PEG 400, 5% PEG 8000, 1 0% glycerol. 

5 Rod shaped crystals formed within 1 day at 25°C. The crystals were flash frozen in liquid 
nitrogen, using the reservoir solution as a cryoprotectant The approximate cell dimensions of 
the crystals were 164.9 A, 164.9 A, 1 1 1.1 A, a = 90°, p = 90°, y = 120°. 

Example 11: Production of a 2.6 A resolution structure of 2C9-FGloop K206E 

1 0 Data was collected to 2.6 A resolution from a crystal of 2C9-FGloop-K206E crystal (prepared as 
described in Example 4) at beam line 14.1 at the European Synchrotron Radiation Facility, using 
a Quantum4 CCD detector from a single crystal at 100 K. The crystal was grown against a 
reservoir solution of 0.1M Tris pH 8.4, 15% PEG 400, 5% PEG 8000, 10% Glycerol, and was 
frozen directly from the reservoir solution. A total of 50 images were collected and processed 

15 using MOSFLM (Leslie, A. G. W. (1992). Jnt CCP4/ESF-EACMB Newslett. Protein 
Crystallogr. 26), scaled using SCALA and reduced using the CCP4 suite of programs 
(Collaborative Computational Project, Number 4, (1994). The CCP4 suite: programs for protein 
crystallography. Acta CrysL D50, 760-763). 

20 Table of data statistics __ 



Resolution 


50-2.6 A 


2.74-2.60 A 


Completeness 


96.5% 


84.3% 


Multiplicity 


2.6 


2.0 


1/ Sigma I 


6.8 


1.2 


R merge 


8.7 


57.0 



This data was used in refinement, using the model generated by the refinement against the initial 
3,0 A data, to generate the coordinates of Table 2. A consistent set of 5% of the reflections was 
flagged for Free R calculation, and extended to the higher resolution. The refinement was 
25 continued using the programs CNX (Brunger et ah, Current Opinion in Structural Biology, Vol. 
8, Issue 5, October 1998, 606-61 1, and commercially available from Accelerys, San Diego, CA) 
and REFMAC (Collaborative Computational Project, Number 4, (1994). The CCP4 suite: 
programs for protein crystallography. Acta CrysL D50, 760-763), to an R factor of 21.9% and 
an R free factor 25.0%. 

30 

Example 12: Structure of 2C9-FGloop. 

Data were collected from a 2C9-FGloop crystal 2 (prepared as described in Example 7) to 3.1 A 
resolution at beamline ID 14.1 (wavelength 0.933 A) at the European Synchrotron Radiation 
Source using a Quantum4 CCD detector from a single crystal, frozen directly from the 
35 crystallisation solution (0.1 M Tris-HCl pH 8.8, 15% PEG 400, 5% PEG 8000, 10% glycerol) at 
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100K. The crystal belong to space group P321. Crystal 2 was found to have cell dimensions 
163.95 A, 163.95 A, 1 1 1.06 A, 90°, 90°, 120°. 

A total of 1 00 degrees of data were collected, processing using MOSFLM, scaled using SC ALA 
5 and reduced further using the CCP4 suite of programs. The structure of 2C9-FGloop was solved 
by molecular replacement using the program AMORE and the 2.6 A 2C9-FGloop-K206E 
structure (Table 2) as a search model. The structure was refined using strict non- 
crystallographic symmetry using the program CNX to generate the coordinates of Table 3. The 
final structure has an R factor of 26.8% and a Free R factor of 29.8% for all data between 30 and 
10 3.1 A. 



Table of data statistics: 



Resolution 


R merge . 


Completeness 


Mult 


I/Sigl 


9.80 


0.041 


96.8 


3.0 


15.0 


6.93 


0.056 


99.8 


3.1 


8.0 


5.66 


0.101 


99.8 


3.1 


6.5 


4.90 


0.113 


99.8 


3.1 


5.3 


4.38 


0.117 


98.4 


2.8 


5.4 


4.00 


0.118 


92.1 


2.4 


5.2 


3.71 


0.141 


80.4 


2.0 


4.1 


3.47 


0.183 


71.7 


1.8 


1.4 


3.27 


0.242 


65.1 


1.7 


2.4 


3.10 


0.374 


58.7 


1.7 


1.7 


Overall 


0.099 


81.9 


2.4 


5.0 



Example 13: Identification and use of P450 binding pocket residues. 

1 5 The crystal structure for 2C9 has for the first time allowed the precise identification of all the 
residues that line the binding site of the enzyme (Table 4). Some residues proposed to be in the 
catalytic site by a variety of sources can now be shown not to be binding pocket residues but 
residues that hold the catalytic residues in place. 

20 Table 4: All residues lining the 2C9 binding pocket 



ARG 


97 


GLY 


98 


ILE 


99 


PHE 


100 


LEU 


102 


ALA 


103 


ALA 


106 


ASN 


107 


GLY 


109 


PHE 


110 


GLY 


111 


ILE 


112 


VAL 


113 


PHE 


114 


THR 


167 


PHE 


168 


ILE 


178 


CYS 


179 


ILE 


181 


ILE 


182 


MET 


198 


LEU 


201 


ASN 


202 


ASN 


204 


1LE 


205 


LEU 


208 


SER 


209 


SER 


210 


PRO 


211 


ILE 


213 


GLN 


214 


ASN 


217 


LEU 


233 


VAL 


237 


MET 


240 


LYS 


241 


ASN 


289 


VAL 


292 


ASP 


293 


LEU 


294 


PHE 


295 


GLY 


296 


ALA 


297 


GLY 


298 


THR 


299 


GLU 


300 


THR 


301 


THR 


302 


SER 


303 


THR 


304 


THR 


305 


ARG 


307 


ASP 


360 


LEU 


361 


LEU 


362 


PRO 


363 


THR 


364 


SER 


365 


LEU 


366 


PRO 


367 


ASN 


474 


GLY 


475 


PHE 


476 


ALA 


477 


SER 


478 


VAL 


479 



Residues previously inferred to be in the binding site of 2C9 from modelling (e.g. homology 
modelling, SRS proposals, 3D/4D-QSAR, sequence alignments, or mutagenesis studies) which 
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with the aid of the crystal structure are now known to line the 2C9 binding pocket are in Table 

5. 



Table 5: Residues previously inferred to be in the binding site of 2C9 



ARG 


97 


GLY 


98 


ILE 


99 


PHE 


100 


LEU 


102 


ALA 


103 


ALA 


106 


ASN 


107 


GLY 


109 


PHE 


no 


GLY 


111 


ILE 


112 


VAL 


113 


PHE 


114 


LEU 


201 


ASN 


202 


ASN 


204 


ILE 


205 


LEU 


208 


SER 


209 


SER 


210 


GLN 


214 


LEU 


233 


VAL 


237 


MET 


240 


LYS 


241 


ASN 


289 


VAL 


292 


ASP 


293 


LEU 


294 


PHE 


295 


GLY 


296 


ALA 


297 


GLY 


298 


THR 


299 


GLU 


300 


THR 


301 


THR 


302 


SER 


303 


THR 


304 


THR 


305 


ARG 


307 


ASP 


360 


LEU 


361 


LEU 


362 


PRO 


363 


THR 


364 


SER 


365 


LEU 


366 


PRO 


367 


ASN 


474 


GLY 


475 


PHE 


476 


ALA 


477 


SER 


478 













5 

Some residues found in the binding pocket have never before been identified as binding site 
residues. These are listed in Table 6. The identification of these will greatly facilitate the 
modelling of compound binding. 

10 Table 6: Residues newly identified as lining the 2C9 binding pocket 



THR 


167 


PHE 


168 


ILE 


178 


CYS 


179 


ILE 


181 


ILE 


. 182 


MET 


198 


PRO 


211 


ILE 


213 


ASN 


217 


VAL 


479 





Accordingly, in a preferred aspect of the invention, the selected coordinates used in a method of 
the invention will comprise at least one coordinate, preferably at least one side-chain coordinate 
of an amino acid residue selected from either Table 5 or 6. '; 

15 

Preferably, the selected coordinates include the coordinates of all the atoms of Table 1, 2, 3 or 8 
relating to at least one amino acid from Table 5 or 6. 

Also preferred, whether all or just some atoms of a particular amino acid are selected, is that at 
20 least 2, more preferably at least 5, and most preferably at least 10 of the selected coordinates are 
of side chain residues from the corresponding number of different amino acid residues. These 
may be selected exclusively from either of Table 5 or 6, or a combination thereof. Preferably at 
least one side chain residue coordinate of Table 6 is included. 

25 Example 14: Modelling other P450 structures. 

Some of the residues in Tables 5 and 6 are residues which do not occur at the sequence positions 
indicated in the Tables in a naturally occurring human 2C9 or are residues which differ in other 
human P450 structures. For these residues in particular, molecular modelling techniques 
(including but not limited to molecular replacement or computer assisted semi-manual methods) 
30 may be used to obtain a model in which a different residue is provided at such a location. For 
example, position 206 (recited in Table 5 above) in the protein 2C9-FGloop is lysine, which 
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comprises a positive charge. Using the X-ray diffraction data of the 2C9-FGloopK206E we 
have modelled the 2C9-FGloop protein to provide coordinate data for this protein. 

The coordinate data corresponds to Table 1 apart from the data for residue 206, which is as 
5 follows: 



ATOM 


1328 


N 


LYS 


A 


206 


-9.209 


86.411 


32. 


115 


1. 


00 


52.64 


A 


N 


ATOM 


1329 


CA 


LYS 


A 


206 


-8.030 


86.236 


32. 


948 


1. 


00 


53.62 


A 


C 


ATOM 


1330 


CB 


LYS 


A 


206 


-6.751 


86.197 


32. 


104 


1. 


00 


56.05 


A 


C 


ATOM 


1331 


CG 


LYS 


A 


206 


-6.295 


84.776 


31. 


762 


1. 


00 


60.03 


A 


C 


ATOM 


1332 


CD 


LYS 


A 


206 


-7.406 


83.981 


31. 


026 


1. 


00 


61.59 


A 


C 


ATOM 


1333 


CE 


LYS 


A 


206 


-7.093 


82.478 


30. 


921 


1. 


00 


62.61 


A 


C 


ATOM 


1334 


NZ 


LYS 


A 


206 


-6.906 


81.756 


32. 


235 


1. 


00 


63.34 


A 


N 


ATOM 


1335 


C 


LYS 


A 


206 


-7.966 


87.351 


33. 


963 


1. 


00 


52.66 


A 


C 


ATOM 


1336 


0 


LYS 


A 


206 


-7.663 


87.117 


35. 


125 


1. 


00 


52.36 


A 


O 



It will be observed that the CB carbon atom, i.e. the first carbon atom in the side-chain is in an 
almost identical position to the Glu CB carbon atom of Table 1, whereas the remaining atoms of 
the side chain (CD, CE & NZ) are in locations based upon a low-energy configuration of the 

10 lysine side chain, taking into* account the connection of the side chain to the CA carbon atom. It 
is thus relatively simple for a person skilled in the art of molecular modelling to arrive at a 
model for a P450 in which one or more residues of the 2C9-FGloopK206E is replaced in an 
analogous manner. The coordinate data for 2C9-FGloop are set out in Table 3. It will be 
appreciated by those of skill in the art that in the space group P321 it is possible to index 

1 5 diffraction data in one of two ways. The data can be converted from one indexing to the other 
using the operator k, h, -1. In the case of 2C9-FGloop K206E at 3.0 A (Table 1) the data were 
indexed differently compared to the data of 2C9-FGloop K206E at 2.6 A (Table 2) or 2C9- 
FGloop (Table 3), and hence while the crystal forms of the proteins are substantially identical, 
the crystal structures are not in the same absolute space. Hence the co-ordinate data for Glu206 

20 in Table 3 is not numerically equivalent to that shown above as this modelled in Table 1. Those 
of skill in the art will be able to convert the above data for residue 206 accordingly. 

Thus where in modelling the interaction of a compound or a metabolite thereof with a P450 
structure of the invention, it is found that the residue at position 206 may be involved in the 
25 interaction with that particular metabolite, the above data for residue 206 may be used in Table 1 
in place of the data for 206Glu. This is in view of the change of charge which results from the 
difference. For compounds or metabolites which are found to interact with other regions of 
P450, there may be no need to amend Table 1 in this manner. 

30 However, similar modelling may be performed for other parts of P450 where it is determined to 
be important that the potential interactions of a compound with the binding pocket in those parts 
of the protein is of particular interest. Thus the residues Pro220, Ala221, Leu222 and Leu223 in 
particular were remodelled in a similar manner to that discussed above in order to predict the 
coordinates of wild type residue side chains in a P450 structure. 
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The modelled coordinates of the 2C9 wild type protein are the same as those contained in Table 
1, 2, 3 or 8 except that the residues listed in Table 7 substitute for the corresponding residues of 
Table 1,2, 3 or 8. 

5 

Thus the present invention covers a structure of 2C9 for use in silico in which the coordinates 
are those of Table 1, 2, 3 or 8, except that the atoms and corresponding coordinates of one or 
more of residues 215, 216, 220, 221, 222, and 223 are substituted by the atoms and 
corresponding coordinates of the wild typed residues of Table 7. Thus to the extent that 
10 previous aspects of the invention relate to Table 1, 2, 3 or 8, they also relate to Table 1, 2, 3 or 8 
with the atoms and corresponding coordinates of one or more of residues 215, 216, 220, 221, 
222, and 223 substituted by those of the wild typed residues listed in Table 7. 

Example IS: Docking Experiment 

1 5 The crystal structure of 2C9 was used to computationally dock a drug molecule into the binding 
pocket. The drug diclofenac, a known substrate for human 2C9, was generated and placed into 
the 2C9 binding pocket using interactive computer graphics. The observed interactions can now 
be used to chemically modify diclofenac via a structure-based design ' strategy to mediate its 
interaction with human 2C9 and improve its therapeutic potential. 

20 - \ "v ■ ■'■ 

Example 16: Refinement of 2C9-FGIoop K206E structurie, ' 

Data generated in Example 1 1 was further refined to generated Table 8 (Figure 5). A total of 
147 water molecules have been added (manually and automatically) and included in the 
refinement. This resulted in an Rfactor of 20.7% and a R free factor of 25.9%. 

25 

Example 17: Production of further 2C9 proteins. 

The nucleic acid encoding 2C9trunc, 2C9P220 (also called 1072), 2C9-FGloop (1015) and 2C9- 
FGloop K206E (1 155) were used to produce further 2C9-encoding nucleic acids using either 
cassette mutagenesis (CM) or site-directed mutagenesis (QC). Site-Directed Mutatgenesis (PCR 

30 mutagenesis) was performed using the Stratagene Quikchange™ mutagenesis kit (catalogue 
number #200518), according to manufacturers instructions. The Quikchange™ mutagenesis 
method generates a mutated plasmid with staggered nicks and uses Dpnl digestion to remove all 
parental DNA. Reactions were made incorporating 5.0 nL of 10X reaction buffer, 5-50 ng 
template plasmid DNA, 1.0 dNTP mix and 125 ng oligonucleotide primers. The primers and 

35 template used for each construct are as listed in the table below. 

Reactions were made to 50 \xL with sterile water, 2.5U Pfu Turbo was then added and the 
reaction overlayed with 30 jiL mineral oil. Thermocycling was then carried out as follows: 
95°C, 30 sec(l cycle), 95°C, 30 sec, 55°C, 1 min, 68°C 13.5 min (18 cycles) and finally a 
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holding period at 4°C. A control reaction was also included with water in place of 
oligonucleotide primers. Following thermocycling 10 U Dpnl was added, under the level of the 
mineral oil, to each reaction. The reactions were then gently mixed followed by centrifiigation in 
a bench top microcentrifuge, 1 min, 13,000 rpm and incubated at 37°C for 3 hr. 

5 

Digested product (1 ^L) was then used to transform 50 ^iL competent £ coli XLl-Blue cells 
(Stratagene). The whole transformation as then plated onto Luria agar plates containing 100 
[ig/ml carbenicillin, inverted, and incubated overnight at 37°C. Plasmid DNA was prepared from 
individual colonies and sequenced to check for the insertion of the correct mutation(s). 

10 

Cassette mutagenesis was performed on the 2C9 FG region (residues 215 to 226) utilising the 
BamHI and Xmal sites, two unique and natural restriction sites that are present in this region. 
Complementary oligonucleotides with the 5' BamHI and 3' Xmal overhang restriction sites 
were designed to introduce mutations in the FG region (Tables 9, 10 and 14). Double stranded 

1 5 oligonucleotides were prepared by heating 1 0 jig of a mixture of complementary 

Oligonucleotides at 100°C for 5 min in 100 \il of water and slow cooling at 25°C. Double 
stranded Oligonucleotides were ligated into purified plasmid pCW-2C9 wt opened by BamHI 
and Xmal restriction enzymes and an aliquot of the ligation was used to transform XI 1 Blue E. 
coli. , t> . ' 

20 \ '^.;v . ; 

2C9 proteins of the invention produced by the above methods are set out in Table 9, which also 
indicates the primers used. Crystals of all these proteins were obtained under a variety of 
conditions, shown in Table lT (see Example 20). 

25 As controls, 2C9 proteins without proline at 220 were made using the same techniques. The 
proteins made are shown in Table 10. Under a range of conditions tested, no protein crystals 
were recovered. 
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Fw 5 1 ccagatctgcaataattttccgcctatcattgattacttccc3 '(125) 
Rev 5 'ggtctagacgttattaaaaggcggatagtaactaatgaaggg3 • (126) 
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Fw 5'cctgatgaaaatggagcaggaaaagcacaaccaacc3* (137) 
Rev 5 *ggttggttgtgcttttcctgctccattttcatcagg3 * ( 1 3 8) 


F w 5 1 ga tgaaaatgg agaaggaagacgacaaccaaccatct gaattta c 3 '(139) 
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Fw 5'catcactttctggatgcaggtggcaattttaagaaaag3 , (147) 
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Mutations 


2C9 S220P 


2C9-FGloop + 
N466D 


2C9-FGloop + F482S 


2C9-FGloop + 
Q192E 


2C9-FGloop + 
Q193E 
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Cl 
O 
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2C9-FGloop + 
K273Q 


2C9-FGIoop + 
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2C9-FGloop + 
E415A 
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Example 18: Production of 2C9 proteins. 

Bacteria Expression. 

The 2C9 proteins of Example 17 were produced in a bacterial expression system. A single 
5 ampicillin resistant colony of XL1 blue cells was grown overnight at 37°C in Terrific Broth 
(TB) with shaking to near saturation and used to inoculate fresh TB media. Bacteria were 
grown to an OD600nm = 0.4 in 1 litre of TB broth containing 100 jig/ml of ampicillin at 37°C at 
185 rpm in 2 litre flask. The heme precursor delta aminolevulinic acid (80 mg/1) was added 30 
min prior to induction with 1 mM isopropyl-P-D-thiogalactopyranoside (IPTG) and the 
10 temperature lowered to 25°C. The bacterial culture was continued under agitation at 25°C for 72 
hours. 

Protein Purification 

The cells were pelleted at 10000 g for 10 min and resuspended in a buffer containing 500 mM 
15 KPi, pH 7.4, 20% glycerol, 10 mM mercaptoethanol, 0.1% (v/v) of protease inhibitor cocktail 
(Calbiochem), 10 mM imidazole, 40 U/ml DNase 1 and 5 mM MgSC>4. 

The cells were lysed by passing twice through a Constant Systems Cell Homogeniser at 10000 
psi. The cell debris was then removed by centrifugation at 22000 x g at 4°C for 30 min. 

•20 . J..; 

• Detergent IGEPAL CA630 (Sigma) was added dropwise from a 10% stock solution to the lysate 
•< at a final concentration of 0.3% (v/v) and the lysate was incubated with previously washed 
NiNTA resin (Qiagen) overnight at 4°C, using agitation. The protein bound-NiNTA resin was 
pelleted by centrifugation at 2000 g for 5 min at 4°C. The resin was washed with 30 resin 

25 volumes of 500 mM KPi, pH 7.4, 20% glycerol, 10 mM mercaptoethanol, 10 mM imidazole, 
1 : 1000 dilution of protease inhibitor cocktail, 0.3%(v/v) IGEPAL CA630 and the resin pelleted 
by centrifugation at 2000 xg for 5 min at 4°C. The resin was then washed with 15 resin volumes 
of 500 mM KPi, pH 7.4, 20% glycerol, 10 mM mercaptoethanol, 20 mM imidazole, 0.1% (v/v) 
protease inhibitors, 0.3% IGEPAL CA630 and the resin recovered by centrifugation as 

30 described above. 

The resin was packed into a column at 4°C and the cytochrome P450 eluted with 500 mM KPi, 
pH 7.4, 20 % glycerol, 10 mM mercaptoethanol, 300 mM imidazole, l:1000(v/v) of protease 
inhibitor cocktail, 0.3%(v/v) IGEPAL CA630. 

35 

The cytochrome P450 obtained from the NiNTA column was quickly desalted into 10 mM KPi, 
pH 7.4, 20% glycerol, 2.0 mM DTT, 1 mM EDTA using a HiPrep 26/10 desalting column 
(Pharmacia), at a flow rate of 5 ml/min. 
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The desalted cytochrome P450 was directly applied to a CM Sepharose column (Pharmacia), 
previously equilibrated with 10 mM KPi, pH 7.4, 20% glycerol, 2.0 mM DTT, 1 mM EDTA. 
The following step elution was applied: wash with 20 column volumes of 10 mM KPi, pH 7.4, 
20% glycerol, 2.0 mM DTT, 1 mM EDTA, wash with the above buffer with 75 mM KC1 in 
5 order to remove any trace of detergent, then eluted with the above buffer, with KC1 concentration 
increased to 500 mM. 

The protein was concentrated up to 40 mg/ml using a microconcentrator for crystallization 
assays. The quality of the final preparation was evaluated by: 

10 

(a) SDS polyacrylamide gel electrophoresis 

This was performed using commercial gels (Nugen) followed by CBB staining according to the 
manufacturer's instructions. The purity as estimated by scanning a digital image of a gel was 
estimated to be at least 95%. 

15 

(b) Mass Spectroscopy 

Mass spectrometry was performed using<a Bruker BioTOF II electrospray time of flight 
instrument. Samples were either diluted by a factor of 1000 straight from storage buffer into 
methanol/water/formic acid (50:48:2-v/v/v), or subjected to a reverse phase separation using a 

20 C4 Millipore 'zip-tip 1 or a C4 HPLC column, before being diluted into methanol/water/formic 
acid. Calibration was achieved by measurement of the 2+ and 1+ charge states of a peptide 
mixture containing Bombesin and angiotensin I or by using the multiple charge states of Horse 
Myoglobin. Data were acquired in the m/z range 200 to 2000 and were subsequently processed 
using Bruker' s X-mass program. Mass accuracy was expected to be better than 1 in 10 000 

25 (lOOppm). 

Predicted and observed mass spectrometry data for the proteins is listed in Table 12. 



Table 12. Mass Spectrometry Data for 2C9 proteins. 



Clone ID 


Calculated Mass (without 
N-met) (Da) 


Observed Mass (Da) 


Mass Diff (without N-met) 
(ppm) 


1015 


53964 


53966.65 


2.65 


1039 


53907.96 


53911.25 


3.29 1 


1072 


53948.02 


53950.23 


2.21 


1078 


53944.03 


53944 


-0.03 i 


1081 


53964.98 


53967 


2.02 


1082 


53903.9 


53909 


5.1 


1085 


53964.98 


53964.95 


-0.03 


1097 


53964.98 


53971 


6.02 


1100 


53963.06 


53971 


7.94 


1101 


53963.96 


53965 


1.04 


1102 


53928.86 


53930.37 


1.51 
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Clone ID 


Calculated Mass (without 
N-met) (Da) 


Observed Mass (Da) 


Mass Diff (without N-met) 
(ppm) 


1115 


53905.96 


53905 


-0.96 


1116 


53906.9 


53910 


3.1 


1117 


53906.9 


53909 


2.1 


1118 


53906.9 


53907 


0.1 J 


1121 


53906.9 


53910 


3.1 


1122 


53905.96 


53908 


2.04 


1123 


53792.72 


53795.16 


2.44 


1155 


53964.94 


53964.84 


-0.1 


1165 


53624.65 


53627 


2.35 


1220 


53933.99 


53936.03 


2.04 


1319 


53937.92 


53942.38 


4.46 


1339 


53948 


53953 


5 


1340 


53889.96 


53892 


2.04 


1361 


53978.03 


53978.83 


0.8 


1362 


53903.97 


53901.62 


-2.35 


1363 


53964 


53964.32 


0.32 j 


1364 


53964 


53966.32 


2.32 


1365 


53953.96 


53955.22 


1.26 


1366 


53918 


53920.06 


2.06 


1367 


53918 


53919.31 


1.31 


1368 


53918 


53900.75 


33.02 


1369 


53964 


53964.08 


0.08 ~ ] 


1370 


53903.97 


53907.52 


3.55 j 


1371 


53978.03 


53978.74 


0.71 


1372 


53901.93 


53901.83 


-0.1 


1391 


53987.04 


53987.61 


0.57 


1392 


53973.01 


53975.41 


2.4 


1394 


53973.01 


53973.65 


0.64 


1396 


53922.95 


53924.21 


1.26 


1397 


53918.06 


53914.83 


-3.23 1 


1423 


53857.9 


53858.94 


1.04 


1424 


53891.92 


53891.92 


0 


1443 


53930.93 


53932.6 


30.97 


1444 


53942.98 


53943.2 


0.22 


1475 


53987.98 


53988.71 


0.73 


1477 


53904.84 


53907.07 


2.23 


1491 


53922.86 


53923.9 


1.04 



Example 19: Activity of 2C9 Proteins of the invention. 

Activity assays on P450 2C9 were performed in a 96-well plate assay format with a Fluoroscan 
Ascent FL Instruments (Labsystem), using the 7-methoxy-4-(trifluoromethyl)-coumarin as a 
fluorescent substrate. 

Fifteen pmoles of purified P450 were reconstituted with 0.1 unit of purified human 
oxidoreductase, in presence of 137 nM of substrate 7-methoxy-4-(trifluoromethyl)-coumarin 
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and a NADPH regenerating system that includes 0.14 mM NADP*, 0.37 mM Glucoses- 
phosphate, 0.38 mM MgCh and 2.8 unit/ml glucose-6-phosphate dehydrogenase, in 180 jxl final 
volume of 25 mM KPi, pH 7.4. Incubations were performed at 37°C for 40 minutes and 37.5 
pmoles of metabolite standard 7-hydroxy -4-(trifluoromethyl)-coumarin were used to 
5 determinate the metabolic rate. The excitation and emission wavelengths used were respectively 
409 and 530 nm. The results for the clones tested are set out in Table 13. 



Table 13. Activity Data. 



Clone 


Activity 


1015(2C9-FGloop) 


0.26 


1072(2C9-P220) 


0.43 


1361 


0.68 


1362 


0.33 


1363 


0.27 


1364 


0.15 


1366 


0.27 


1367 


0.56 


1368 


0.52 


1369 


0.43 


1370 


, 0.69 


1371 


0.17 


1372 


0.20 ! • 



10 As a control, the activity of the prptein 2C9trunc (wild type) and 1 365 (which both have no 
proline at 220) was determined arid found to be 0.47 and 0.43, respectively. 

Example 20: Crystallisation of 2C9 proteins. 

Crystals of the 2C9 mutants were grown using the hanging drop vapour diffusion method. 
15 Protein at 10-60 mg/ml (usually 40 mg/ml) in lOmM Kpi pH 7.4, 0.5 M KC1, 2mM DTT, ImM 
EDTA, 20% glycerol, was mixed in a 1 : 1 ratio, using 0.5 yd drops, with a reservoir solution. A 
number of different 2C9 proteins of the invention formed crystals under the following reservoir 
solution conditions: 

0.05-0.1 M Tris-HClpH 8.0-8.8, 0.1-0.2 M Lithium sulphate, 10-15% PEG 4000; 
20 0.1 MTrispH 8.0-8.8, 15-30% PEG 400, 5% PEG 8000, 10% glycerol; and 
0.1-0.4 M KH 2 P0 4 , 0-25 % PEG 3350, 0-10% glycerol. 



Further reservoir solutions containing the conditions listed in Table 1 1 (Figure 6) were also used 

to obtain further crystals of various different 2C9 proteins of the invention. In Table 1 1 , 
25 crystallisation of the clones identified by clone number was obtained by using a reservoir 

solution containing the constituent parts listed in the columns, wherein these are as follows: 

Buffer (M)-Molarity of buffer (in M). 

Buffer-Buffer type. 

pH-pH of buffer used. 
30 Salt (M)-Molarity of salt (in M). 
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Salt-Salt type. 

Ppt (M)-Molarity of precipitant (in M). 
Ppt-Precipitant type. 

Ppt 2 (M)-Molarity of precipitant 2 (in M). 
5 Ppt 2-Precipitant 2 used. 

Add M-Molarity of additive (in M). 
Additive-Additive used. 

Example 21: 2C9-2C19 Chimeras. 

10 Seven further 2C9 proteins were generated which were based upon substitution into 2C9 of 
residues found in 2C19. Three chimeras (1661, 1662 and 1664) were generated by site directed 
mutagenesis as described in Example 17 above and using the primers listed below in Table 14. 
The four other chimeras, 1595, 1600, 1610 and 1632 were generated by cloning methods as 
follows: 

15 

Chimera 1595 

The mutant 1 155 199H was first generated by the Quikchange™ mutagenesis method, using the 
oligonucleotides listed in Table 14. Residues 227 to 339 were then substituted in the construct 
1155 I99H by those present in cytochrome P450 2C 1 9 (clone 1 026) by cloning the Xmal/SphI 
20 , 339-bp DNA fragment of 2C19 into the plasmid pCW-1155 I99H that was opened by the same 
restriction enzymes, to yield the chimera 1595. 

Chimera 1600 

Chimera 1600 was yielded from chimera 1595 by substituting residues 1 to 282 by those found 
25 in the 1 1 55 I99H construct. A silent restriction site EcoRI (underlined) was introduced into the 
1 155 199H construct at position 784 by PCR amplification using the following 
S'oligonucleotides: S'ctttcaatagtgaattcagatggttggttgtgcS' (SEQ ID NO:226) and 
5 , tatggctaagaaaacgagctctaaagggc3 ' (SEQ ID NO:225) with the EcoRI restriction site 
underlined. A total of 28 cycles at 94 °C for 30 sec, 55 °C for 1 min, and 72 °C for 1 min were 
30 followed by an extension of 1 0 min at 72 °C. The 795-bp PCR fragment was double digested 
with Notl/EcoRI and purified by agarose gel extraction and elution. The Notl/EcoRI DNA 
fragment was then cloned into the plasmid 1595 opened by the Notl/EcoRI restriction enzymes 
to yield the 1 155 I99H/1595 chimera. Finally, the L362I change was introduced in the 1 155 
I99H/1595 chimera by the Quikchange™ mutagenesis method, using the oligonucleotides listed 
35 in Table 14, to yield the chimera 1600. 



Chimera 1610 

Chimera 1610 was yielded from the construct 1 1 55 by substituting residues 2 1 5 to 328 by those 
found in the chimera 1600. The BamHI/Afflll DNA fragment was isolated from the chimera 



WO 03/035693 PCT/GB02/04872 

78 

1600 and cloned into the plasmid pCW-1 155 opened with the BamHI/Afflll restriction 
enzymes. 

Chimera J 632 

5 The construct 1632 was yielded from the chimera 1600 by substituting residues 329 to 476 in 
1600 by those found in the construct 1 155. The AfiHI/Sall DNA fragment was isolated from the 
construct 1 155 and cloned into the plasmid pCW-1600 opened with the Afflll/Sall restriction 
enzymes. Table 14 sets out the chimeras. 

1 0 Table 1 4. 2C9-2C 1 9 chimeras 



Clone 


Mutations 


Primers (SEQIDNOs) 


1595 


2C9-FGloop + K206E 
I99HV237LK241E 
Y243DM257I Q261R 
M269I H276Q P279Q 
S286N E288V N289I 
V292A F295L 1331V 


Fw S'ctggaagaggccatttcccactggctgaaagS' (223) 
Rev S'cfficagccagtgggaaatggcctcttccagS' (224) 


1600 


2C9-FGIoop + K206E 
I99H S286N E288V 
N289I V292AF295L 
L3621 133 IV 


Fw S'tatggctaagaaaacgagctctaaagggcS' (225) 
Rev 5'cmcaatagtgaattcagatggttggttgtgc3'[EcoRI] (226) 
Fw S'gagatacattgaccttettcccaccagcctgcS' [L362I] (227) 
Rev 5'gcaggctg#gggaataag#caatgmctc3' (228) 


1610 


2C9-FGloop + K206E 
S286N E288V N2891 
V292A F295L 


(see text) 


1632 


2C9-FGIoop + K206E 
I99H S286N E288V 
N2891 V292A F295L 


(seetext) ; 


1661 


2C9-FGloop + K206E 
I99HS286NN289I 


Fw 5 , ctgaatttactattgaaaacttggaaatcactgcagttgacttgtttgg3' (229) 
Rev 5 , ccaaacaagtcaactgcagtgatttccaagttttcaatagtaaattcag3 , (230) 


1662 


2C9-FGloop + K206E 
S286N 


Fw 5 , gaatttactattgaaaacttggaaaacactgcagttg3' (231) 
Rev 5 , caactgcagtgttttccaagttttcaatagtaaattc3' (232) 


1664 


2C9-FGloop + K206E 
S286NN2891 


Fw 5'gaatttactattgaaaacttggaaaacactgcagttg3' (233) 
Rev S'caactgcagtgtmccaagttttcaatagtaaattcS' (234) 
Fw 5 , ctattgaaagcttggaaatcactgcagttgacttg3 , (235) 
Rev S'caagtcaactgcagtgatttccaagctttcaatagS 1 (236) 



Example 22: Production of 2C9-2C19 chimeras. 

These protein were produced as described above for the 2C9 proteins of Example 1 8 above. 
1 5 Predicted and observed mass spectrometry data for the proteins is listed in Table 15. 
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Table 15. Mass Spectrometry for 2C9 proteins. 



Clone ID 


Calculated Mass (without 
N-met) (Da) 


Observed Mass Pa) 


Mass Diff (without N-met) 
(ppm) 


1595 


53889.79 


53891.6 


1.81 


1600 


53908.92 


53907.07 


-1.85 


1610 


53898.97 


53900.75 


1.78 


1632 


53922.95 


53922.27 


-0.68 ! 


1661 


54015 


54018.32 


3.32 


1662 


53991.97 


53997.11 


5.14 | 


1664 


53908.92 


53910.93 


37.29 



Example 23: Vaiidation of 2C9-FGloop K206E. 

The substrate specificity of 2C9-FGloop K206E was characterized by performing metabolic 
assays with diclofenac as substrate, in combination with inhibition assays with six 
substrates/inhibitors of 2C9 reported in the literature 

The 4-diclofenac hydroxylase assays (Figure 1 1), determined, following the method described by 
Mancy et al t (Biochemistry (1999) 38, 14264-14270) indicate that the Km value of 2C9- 
FGloop K206E mutant for diclofenac is similar to that obtained for the native N-truncated 2C9, 
and falls within the range of values reported in the literature for the native full-length 2C9. 
However, cytochrome P450 2C9-FGloop K206E exhibits a two-fold lower Vmax value that 
may reflect altered interactions with its redox partner. Results from the inhibition studies (Table! 
16) also indicate that the inhibition profile of 2C9-FGloop K206E is unchanged when compared 
to the native N-truncated enzyme, with Ki and IC50 values that match closely those reported in 
the literature. 

These results, altogether, clearly demonstrate that the mutations introduced in the FG loop 
region to promote the crystallization of 2C9 do not change the substrate specificity, nor do they 
modify the integrity of the substrate-binding pocket. Therefore, 2C9-FGloop K206E represents a 
suitable model of the native 2C9 to study the binging mode of chemical compounds into the 
active site. 



Table 16. Activity of 2C9-FGloop K206E (1 155) 



Compound 


2C9 


2C9trunc 


2C9-FGloop 




Published data (u.M) 


lC50/Ki (uM) 


K206E 








IC50/Ki (uM) 


BispHenol A 


Km=4 . 


4.8/2.9 


7.8/3.3 


Fluoxetine 


Ki=13 


1.5/1.5 


2.1/4.1 


Phenytoin 


Ki=6 


250/116 


40/62 


Sulfaphenazole 


Ki=0.5 to 1.6 


0.4/0.9 


1.6/0.93 


4 Phenyl Imidazole 


NA 


3/1.6 


2.8/1.2 


Fluvoxamine 


Ki=2-5 


0.6/0.7 


0.4/0.85 
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Example 24; Activity of 2C9-2C19 Chimeras. 

The substrate specificity of the proteins made in Example 22 was characterized by performing 
inhibition assays with six substrates/inhibitors of 2C19 and 2C9 reported in the literature. 

5 The activity and inhibition assays were performed on the 2C9-2C 1 9 chimeric proteins and the 
results are shown in Table 17. 



Table 17. Activity of 2C9-2C19 chimeras. 





AA 

differences 


7-MFC 
(min-1) 


Sulfaphenazole 
Ki (nM) 


Diclofenac 
Ki(uM) 


Piroxicam 
Ki (uM) 


4-Phenyl 
Imidazole 
Ki OiM) 


Fluvoxamine 
Ki (uM) 


1155 




0.23 


1 


4 


30 


3 


0.4 


2C19 

(published) 


43 


2.98 


>500 


231 


133 


0.5 


1.5 


1595 


15 


3.2 












1600 


8 


3.3 


>500 


>200 


>100 


1 


<! 


1632 


6 


3.31 


>500 


>200 ■ 


>100 


1.1 


1.6 


1661 


3 


2.14 


>500 


>200 


>100 


0.9 


0.5 



10 As can be seen 1 632 and 1 66 1 display 2C 1 9 like activity. Hence, a 2C9-2C 1 9 chimera can also 
be made by making the following changes I99HS286N E288V N289I V292A F295L. An 
alternative minimal mutant is I99H S286N N289I. 

Example 25: Crystallisation of 2C9-2C19 chimeric proteins. 

1 5 Crystals were prepared as described in Example 20 above. The crystals were grown over a 
reservoir solution containing the following conditions: 

0.05-0.1 M Tris-HCl pH 8.0-8.8, 0.1-0.2 M Lithium sulphate, 10-15% PEG 4000; 
0.1 M Tris pH 8.0-8.8, 15-30% PEG 400, 5% PEG 8000, 10% glycerol; and 
0.1-0.4 M KH 2 P0 4 , 0-25 % PEG 3350, 0-10% glycerol; and also the conditions listed in Table 
20 11 (Figure 6). 

Example 26: Homology Modelling of 2C19. 

Using homology modelling, a model of the 2C1 9 protein was produced. The model was 
constructed from an alignment of the 2C9 template structure and the target sequence using 
25 CLUSTALW. The alignments were adjusted with information from the PSIPRED secondary 
structure program and optimised manually. The program MODELLER was used to build and 
optimise the three-dimensional models, with the final model being the one which had the lowest 
energy and closely satisfied the restraints generated by the program. The 2C19 model produced 
is set out in Table 1 8 (Figure 7). 



WO 03/035693 PCT/GB02/04872 

81 

Example 27: Homology modelling of 2C18. 

This was performed by determining the residues that differ in 2C1 8 from 2C19 and using the 
techniques described above in Example 25 to determine the coordinates of those residues. 
These coordinates, set out in Table 1 9 (Figure 8), may be substituted into the 2C9 or 2C 1 9 
5 coordinate tables. 

Example 28; Homology modelling of 2C8. 

This was performed by determining the residues that differ in 2C8 from 2C19 and using the 
techniques described above in Example 25 to determine the coordinates of those residues. 
10 These coordinates, set out in Table 20 (Figure 9), may be substituted into the 2C9 or 2C19 
coordinate tables. 

Summary 

While the invention has been described in conjunction with the exemplary embodiments 
15 described above, many equivalent modifications and variations will be apparent to those skilled 
in the art when given this disclosure. Accordingly, the exemplary embodiments of the invention 
set forth are considered to be illustrative and not limiting. Various changes to the described 
embodiments may be made without departing from the spirit and scope of the invention. 
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Claims: 

1 . A P450 2C9 protein comprising residues 3 1 to 490 of the wild type sequence and which 
comprises the following changes: 

position 220 or position 222 is proline; 
optionally up to 30 other positions are altered; 
and wherein the region N-terminal to position 3 1 is other than wild-type. 

2. The protein of claim 1 wherein position 221 is not proline. 

3. The protein of claim 1 or 2 wherein said up to 30 other positions are altered to introduce 
residues found in the corresponding position in another cytochrome P450 molecule. 

4. The protein of claim 3 wherein said another cytochrome P450 molecule is selected from 
the group consisting of 2C19, 2C18 and 2C8. 

5. The protein of claim 1 which is selected from the group consisting of SEQ ID 
NO:(2x+2), wherein x is an integer from 1 to 52. 

6. The protein of any one of the preceding claims in crystal form. 

7. A crystallisable composition comprising a 2C9 P450 protein complexed to a ligand. 

8. A crystal of P450 2C9 protein having a trigonal space group P32 1 . 

9. The crystal of claim 8 with unit cell dimensions selected from the group: 
a = b= 165.46 A±5%, and c= 111.70 A ± 5%; 

a = b= 16L35A ±5%, and c= 110.75 A±5%;and 
a = b = 163.95A ± 5%, and c = 1 1 1.06 A ± 5%. 

10. A crystal of P450 2C9 protein having a resolution better than 3.1 A. 

11. A crystal of P450 protein having the structure defined by the co-ordinates of Table 1 , 2, 
3 or 8. 

12. A method of making P450 2C9 protein crystals which method comprises the hanging 
drop vapour-diffusion technique, using a precipitant solution comprising 0-0.2 M Tris-HCl (pH 
8-9), 0-0.25 M Li2S04, 0-20% PEG 4000. 

13. A computer-based method for the analysis of the interaction of a molecular structure 
with a P450 structure, which comprises: 
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providing the P450 structure of Table 1, 2, 3, 8 or 18 or selected coordinates thereof; 
providing a molecular structure to be fitted to said P450 structure or selected coordinates 
thereof; and 

fitting the molecular structure to said P450 structure. 

14. The method of claim 13 wherein said selected coordinates include atoms from one or 
more of the residues of Table 4. 

15. The method of claim 14 wherein at least one of said atoms is from a residue of Table 6. 

1 6. The method of any one of claims 1 3 to 1 5 which further comprises the steps of: 
obtaining or synthesising a compound which has said molecular structure; and 
contacting said compound with P450 protein to determine the ability of said compound 

to interact with the P450. 

17. The method of any one of claims 13 to 15 which further comprises the steps of: 
obtaining or synthesising a compound which has said molecular structure; 
forming a complex of a 2C9 P450 protein and said compound; and 
analysing said complex by X-ray crystallography to determine the ability of said 

compound to interact with the P450. 

18. The method of any one of claims 13 to 15 which further comprises the steps of: 
obtaining or synthesising a compound which has said molecular structure; and 
determining or predicting how said compound is metabolised by said P450 structure; and 
modifying the compound structure so as to alter the interaction between it and the P450. 

19. A compound having the modified structure identified using the method of claim 18. 

20. A method of predicting three dimensional structures of P450 homologues or analogues 
of unknown structure, the method comprises the steps of: 

aligning a representation of an amino acid sequence of a target P450 protein of unknown 
three-dimensional structure with the amino acid sequence of the P450 of Table 1, 2, 3, 8 or 18 to 
match homologous regions of the amino acid sequences; 

modelling the structure of the matched homologous regions of said target P450 of 
unknown structure on the corresponding regions of the P450 structure as defined by Table 1, 2, 
3, 8 or 18; and 

determining a conformation for said target P450 of unknown structure which 
substantially preserves the structure of said matched homologous regions. 
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21. The method of claim 20 wherein said target P450 protein is selected from the group 
consisting of 2C8, 2C18 and2C19. 

22. A chimaeric protein having a binding cavity which provides a substrate specificity 
substantially identical to that of P450 2C9 protein, 

wherein the chimaeric protein binding cavity is lined by a plurality of atoms which 
correspond to selected P450 2C9 atoms lining the P450 2C9 binding cavity, the relative 
positions of said plurality of atoms corresponding to the relative positions, as defined by Table 
1 , 2, 3 or 8, of said selected P450 2C9 atoms. 

23. A method for determining the structure of a protein, which method comprises; 
providing the co-ordinates of Table 1, 2, 3, 8 or 18 or selected coordinates thereof, and 
either (a) positioning said co-ordinates in the crystal unit cell of said protein so as to 

provide a structure for said protein, or (b) assigning NMR spectra peaks of said protein by 
manipulating said co-ordinates. 

24. A method for determining the structure of a compound bound tp P450 protein, said 
method comprising: 

providing a crystal of P450 protein; 

soaking the crystal with the compound to form a complex; and 
determining the structure of the complex by employing the data of Table 1, 2, 3, 8 or 18 
or a portion thereof. 

25. A method for determining the structure of a compound bound to P45.0 protein, said 
method comprising: 

mixing P450 protein with the compound; 
crystallizing a P450 protein-compound complex; and 

determining the structure of the complex by employing the data of Table 1, 2, 3, 8 or 18 
or a portion thereof. 

26. A method of assessing the ability of a compound to interact with P450 2C9 protein 
which comprises: 

obtaining or synthesising said compound; 

forming a crystallised complex of a P450 2C9 protein and said compound, said complex 
diffracting X-rays for the determination of atomic coordinates of said complex to a resolution of 
better than 3. 1 A; and 

analysing said complex by X-ray crystallography to determine the ability of said 
compound to interact with the P450 2C9 protein. 
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27. A computer system, intended to generate structures and/or perform optimisation of 
compounds which interact with P450, P450 homologues or analogues, complexes of P450 with 
compounds, or complexes of P450 homologues or analogues with compounds, the system 
containing computer-readable data comprising one or more of: 

(a) atomic coordinate data according to Table 1, 2, 3, 8 or 18, said data defining the 
three-dimensional structure of P450 or at least selected coordinates thereof; 

(b) structure factor data for P450, said structure factor data being derivable from the 
atomic coordinate data of Table 1 , 2, 3, 8 or 1 8; 

(c) atomic coordinate data of a target P450 protein generated by homology modelling of 
the target based on the data of Table 1, 2, 3, 8 or 1 8; 

(d) atomic coordinate data of a target P450 protein generated by interpreting X-ray 
crystallographic data or NMR data by reference to the data of Table 1 , 2, 3, 8 or 1 8; and 

(e) structure factor data derivable from the atomic coordinate data of (c) or (d). 

28. A computer system according to claim 27, wherein said atomic coordinate data is for at 
least one of the atoms provided by the residues of Table 4. 

29. A computer system according to claim 27 or 28 comprising: 

(i) a computer-readable data storage medium comprising data storage material encoded 
with said computer-readable data; 

(ii) a working memory, for storing instructions for processing said computer-readable 
data; and 

(iii) a central-processing unit coupled to said working memory and to said computer- 
readable data storage medium for processing said computer-readable data and thereby 
generating structures and/or performing rational drug design. 

30. A computer system according to claim 29 further comprising a display coupled to said 
central-processing unit for displaying said structures. 

31. A method of providing data for generating structures and/or performing optimisation of 
compounds which interact with P450, P450 homologues or analogues, complexes of P450 with 
compounds, or complexes of P450 homologues or analogues with compounds, the method 
comprising: 

(i) establishing communication with a remote device containing computer-readable data 
comprising at least one of: (a) atomic coordinate data according to Table 1, 2, 3, 8 or 18, said 
data defining the three-dimensional structure of P450, or the coordinates of a plurality of atoms 
of P450; (b) structure factor data for P450, said structure factor data being derivable from the 
atomic coordinate data of Table 1, 2, 3, 8 or 18; (c) atomic coordinate data of a target P450 
homologue or analogue generated by homology modelling of the target based on the data of 
Table 1, 2, 3, 8 or 18; (d) atomic coordinate data of a protein generated by interpreting X-ray 
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crystallographic data or NMR data by reference to the data of Table 1, 2, 3, 8 or 18; and (e) 
structure factor data derivable from the atomic coordinate data of (c) or (d); and 
(ii) receiving said computer-readable data from said remote device. 

32. A computer-readable storage medium comprising a data storage material encoded with 
computer-readable data, wherein the data are defined by: 

(a) atomic coordinate data according to Table 1, 2, 3, 8 or 18, said data defining the 
three-dimensional structure of P450 or at least selected coordinates thereof; 

(b) structure factor data for P450, said structure factor data being derivable from the 
atomic coordinate data of Table 1, 2, 3, 8 or 1 8; 

(c) atomic coordinate data of a target P450 protein generated by homology modeling of 
the target based on the data of Table 1, 2, 3, 8 or 18;' 

(d) atomic coordinate data of a target P450 protein generated by interpreting X-ray 
crystallographic data or NMR data by reference to the data of Table 1 , 2, 3, 8 or 1 8; and 

(e) structure factor data derivable from the atomic coordinate data of (c) or (d). 

33. A computer-readable storage medium according to claim 32, wherein said atomic 
coordinate data is for at least one of the atoms provided by the residues of Table 4. 

34. A computer-readable storage medium, comprising a data storage material encoded with 
computer readable data, wherein the data are defined by all or a portion of the structure 
coordinates of the P450 protein of Table 1, 2, 3, 8 or 18, or a homologue of P450, wherein said 
homologue comprises backbone atoms that have a root mean square deviation from the 
backbone atoms of Table 1, 2, 3, 8 or 18 of not more than 2.0 A. 

35. A computer-readable storage medium comprising a data storage material encoded with a 
first set of computer-readable data comprising a Fourier transform of at least a portion of the 
structural coordinates for the P450 protein according to Table 1, 2, 3, 8 or 1 8; which data, when 
combined with a second set of machine readable data comprising an X-ray diffraction pattern of 
a molecule or molecular complex of unknown structure, using a machine programmed with the 
instructions for using said first set of data and said second set of data, can determine at least a 
portion of the structure coordinates corresponding to the second set of machine readable data. 

36. A nucleic acid comprising a sequence coding for the 2C9 protein of any one of claims 1 
to 5. 

37. A vector comprising the nucleic acid of claim 36 operably linked to a promoter. 



38. A host cell carrying the vector of claim 37. 
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39. A method of making the protein of any one of claims 1 to 5 comprising culturing the 
host cell of claim 38. under conditions to express the protein, and recovering the protein. 

40. The method of claim 39 which comprises: 

(a) expressing in a host cell culture said cytochrome 2C9 P450 molecule; 

(b) recovering said cells from said culture and suspending said cells in salt buffer having 
a conductivity of from 12 to 1 10 mS/cm; 

(c) lysing said cells and removing cell debris to provide a high-salt lysate; 

(d) adding detergent to said lysate (for example 0.015% to 1.2% v/v) to provide a high- 
salt-detergent lysate; 

(e) recovering said P450 from said lysate. 

4 1 . The method of claim 40 wherein step (e) is performed by: 
(e(i)) binding said 2C9 P450 to an affinity support; 
(e(ii)) rinsing said support in a high-salt-detergent wash; 

(e(iii)) removing said 2C9 P450 in a high-salt-detergent buffer to provide a P450-high- 
salt-detergent preparation; followed by 

(f) exchanging the buffer to a low ionic strength buffer without detergent by size- 
exclusion chromatography to provide a P450-low-salt preparation. 

42. The method of claim 41 which further comprises crystallising the protein. 

43. The method of claim 42 wherein the protein is crystallised by the hanging drop method 
of claim 12 to provide a crystal. 

44. The method of claim 43 which further comprises obtaining an X-ray diffraction pattern 
of the crystal. 

45. Use of the atomic coordinate data or selected coordinates thereof of any one of Tables 1 , 
2, 3, 8 or 1 8 for the provision of a computer-generated structure of a cytochrome P450 molecule 
bound to a ligand. 
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86.271 22.664 1.00 67.17 A N 

86.433 22.465 1.00 69.41 A C 

85.650 21.601 1.00 69.86 A N 

87.376 23.126 1.00 70.46 A N 

86.103 23.822 1.00 47.60 A C 

85,060 23.200 1.00 47.93 A 0 

87.220 23.273 1.00 46.76 A N 

87.293 21.875 1.00 45.50 A C 

88.740 21.516 1.00 46.21 A C 
88.892 20.209 1.00 45.92 A C 

88.741 20.115 1.00 46.34 A C 
89.121 19.059 1.00 44.48 A C 
88.811 18.891 1.00 47.70 A C 
89.192 17.836 1.00 44.86 A C 
89.036 17.744 1.00 46.97 A C 
86.414 21.541 1.00 45.10 A C 
85.723 20.527 1.00 45.58 A 0 
86.465 22.396 1.00 44.90 A N 
85.689 22.216 1.00 43.69 A C 

85.938 23.383 1.00 43.27 A C 
87.293 23.436 1.00 41.41 A 0 
84.208 22.127 1.00 43.64 A C 
83.515 21.235 1.00 43.64 A 0 
83.722 23.059 1.00 43.63 A N 
82.319 23.067 1.00 45.47 A C 
82.005 24.291 1.00 43.12 A C 
81.732 25.525 1.00 41.00 A C 
81.569 26.704 1.00*39.88 A C 
80.491 25.321 1.00 38.44 A C 
81.642 21.809 1.00 4B.12 A C 
80.780 21.285 1.00 49.59 A 0 
82.612 21.315 1.00 50.88 A N 
82.220 20.108 1.00 52.84 A C 
83.018 19.968 1.00 55.20 A C 
82.826 21.146 1.00 60.77 A C 
83.278 20.802 1.00 66.89 A S 
84.979 20.156 1.00 65.46 A C 
82.350 18.839 1.00 52.57 A C 
82.058 17.753 1.00 53.90 A 0 
82.789 18.974 1.00 52.46 A N 
82.912 17.81B 1.00 52.38 A C 
84.276 17.772 1.00 53.59 A C 
84.387 18.897 1.00 55.82 A 0 
85.428 17.789 1.00 53.91 A C 
81.825 17.907 1.00 51.33 A C 
81.425 16.897 1.00 52.08 A 0 
81.365 19.129 1.00 49.85 A N 
80.325 19.376 1.00 48.34 A C 
80.449 20.791 1.00 47.37 A C 
81.407 20.941 1.00 46.87 A C 
81.735 22.401 1.00 46.14 A C 
80.767 20.318 1.00 47.51 A C 
78.948 19.177 1.00 47.86 A C 

77.939 19.519 1.00 47.61 A 0 
78.918 18.633 1.00 48.01 A N 
77.667 18.362 1.00 48.97 A C 
77.938 18.068 1.00 51.30 A C 
78.854 19.062 1.00 53.99 A C 
79.065 18.629 1.00 58.79 A C 
79.211 17.171 1.00 63.21 A N 
79.891 16,516 1.00 64.60 A C 
79.946 15.190 1.00 65.17 A N 
80.534 17.180 1.00 65.96 A N 
77.094 17.124 1.00 47.80 A C 
77.850 16.239 1.00 48.15 A 0 
75.775 17.043 1.00 46.88 A N 
75.154 15.891 1.00 46.90 A C 
73.639 15.903 1.00 46.63 A C 
72.882 15.162 1.00 47.62 A C 
71.727 14.785 1.00 49.61 A O 
73.528 14.962 1.00 47.53 A N 
75.716 14.530 1.00 47.39 A C 
75.594 13.547 1.00 46.25 A 0 
76.318 14.483 1.00 49.02 A N 
76.916 13.260 1.00 50.18 A C 
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75.993 12.636 1.00 49.20 A C 

74.691 12.152 1.00 49.20 A C 

74.611 10.931 1.00 49.32 A C 

73.556 12.962 1.00 48.67 A C 

73.420 10.531 1.00 48.88 A C 
72.372 12.570 1.00 46.13 A C 
72.304 11.358 1.00 47.35 A C 
78.261 13.560 1.00 51.73 A C 
78.745 12.776 1.00 53.08 A O 
78.858 14.693 1.00 52.77 A N 
80.132 15.065 1.00 55.56 A C 
81.282 14.145 1.00 58.23 A C 
82.436 14.434 1.00 59.36 A O 
80.986 13.038 1.00 59.36 A N 
82.024 12.085 1.00 60.33 A C 
83.003 12.732 1.00 58.84 A C 
82.345 13.424 1.00 58.09 A C 
83.477 14.301 l.OO 57.30 A S 
82.763 13.873 1.00 54.57 A C 

81.421 10.818 1.00 61.93 A C 
80.287 10.829 1.00 62.78 A O 
82.172 9.724 1.00 62.78 A N 
81.702 8.465 1.00 64.09 A C 
80.383 7.933 1.00 64.96 A C 
79.946 8.262 1.00 65.79 A O 
79.759 7.080 1.00 65.58 A N 
78.475 6.470 l.OO 65.76 A C 
78.588 4.931 1.00 68.51 A C 
79.196 4.347 1.00 71.45 A C 
78.709 2.907 1.00 73.91 A C 
79.076 1.925 1.00 75.37 A C 
78.559 0.533 1.00 75.79 A N 
77.513 6.887 1.00 63.97 A C 
76.335 6.524 1.00 64.44 A O 
78.046 7.649 1.00 60.95 A N 
77.281 8.138 1.00 57.95 A C 
78.109 7.955 1.00 57.45 A C 
77.358 8.173 1.00 58.21 A C 
78.181 7.671 1.00 58.90 A C 
77.523 7.666 1.00 60.11 A N 
76.300 7.438 1.00 60.54 A C 

75.807 7.669 1.00 60.64 A. N 
75.563 6.791 1.00 61.00 A N 
77.009 9.616 l.OO 56.35 A C 
77.881 10.318 1.0056. 84 A O 
75.800 10.081 1.00 53.54 A N 
75.425 11.472 1.00 50.13 A C 
73.974 11.546 1.00 49.35 A C 
73.178 11.191 1.00 4B.55 A 0 
75.556 12.337 1.00 48.08 A C 
75.389 11.864 1.00 4B.22 A 0 

75.808 13.620 1.00 44.92 A N 
75.911 14.570 1.00 41.97 A C 
76.084 15.977 1.00 41.15 A C 
75.906 17.037 1.00 39.54 A C 
77.458 16.067 1.00 42.31 A C 
77.715 17.373 1.00 45.27 A C 
74.648 14.508 1.00 40.77 A C 
74.686 14.686 1.00 40.02 A O 
73.525 14.255 1.00 38.02 A N 
72.285 14.181 1.00 36.75 A C 
71.116 14.145 1.00 37.54 A C 
69.813 14.360 1.00 38.52 A C 
68.644 14.414 1.00 38.04 A C 
68.765 15.022 1.00 38.75 A O 
67.594 13.856 1.00 38.20 A O 
72.246 12.949 1.00 36.94 A C 
71.630 12.947 1.00 34.74 A O 
72.884 11.8B4 1.00 37.02 A V 
72.915 10.671 1.00 38.35 A C 
73.578 9.550 1.00 41.42 A C 
72.573 8.623 1.00 45.89 A C 
71.504 8.440 1.00 47.43 A O 
72.853 8.078 1.00 47.48 A O 
73.665 10.905 1.00 38.09 A C 



SUBSTITUTE SHEET (RULE 26) 



WO 03/035693 



12/514 



PCT/GB02/04872 



Figure 1 



ATOM 


822 


0 


ASP 


A 


143 


-23.436 


ATOM 


823 


N 


ARG 


A 


144 


-24.549 


ATOM 


824 


CA 


ARG 


A 


144 


-23.431 


ATOM 


825 


CB 


ARG 


A 


144 


-23.933 


ATOM 


626 


CG 


ARG 


A 


144 


-25.028 


ATOM 


82*7 


CD 


ARG 


A 


144 


-25.226 


ATOM 


828 


NE 


ARG 


A 


144 


-25.520 


ATOM 


829 


CZ 


ARG 


A 


144 


-26.435 


ATOM 


830 


NH1 


ARG 


A 


144 


-27.172 


ATOM 


831 


NH2 


ARG 


A 


144 


-26.620 


ATOM 


832 


C 


ARG 


A 


144 


-22.474 


ATOM 


833 


0 


ARG 


A 


144 


-21.265 


ATOM 


834 


N 


VAL 


A 


145 


-23.028 


ATOM 


835 


CA 


VAL 


A 


145 


-22.175 


ATOM 


836 


CB 


VAL 


A 


145 


-22.968 


ATOM 


837 


CGI 


VAL 


A 


145 


-22.102 


ATOM 


83B 


CG2 


VAL 


A 


145 


-23.456 


ATOM 


839 


C 


VAL 


A 


145 


-21.550 


ATOM 


840 


0 


VAL 


A 


145 


-20.401 


ATOM 


841 


N 


GLN 


A 


146 


-22.402 


ATOM 


842 


CA 


GLN 


A 


146 


-21.913 


ATOM 


843 


CB 


GLN 


A 


146 


-23.043 


ATOM 


844 


CG 


GLN 


A 


146 


-23.651 


ATOM 


845 


CD 


GLN 


A 


146 


-25.016 


ATOM 


846 


OE1 


GLN 


A 


146 


-25.574 


ATOM 


847 


NC2 


GLN 


A 


146 


-25,566 


ATOM 


848 


C 


GLN 


A 


146 


-20.836 


ATOM 


849 


0 


GLN 


A 


146 


-19.875 


ATOM 


850 


N 


GLU 


A 


147 


-20.987 


ATOM 


851 


CA 


GLU 


A 


147 


-20.011 


ATOM 


852 


CB 


GLU 


A 


147 


-20.537 


ATOM 


853 


CG 


GLU 


A 


147 


-19.736 


ATOM 


854 


CD 


GLU 


A 


147 


-19.530 


ATOM 


855 


OE1 


GLU 


A 


147 


-20.259 


ATOM 


856 


OE2 


GLU 


A 


147 


-18.643 


ATOM 


857 


C 


GLU 


A 


147 


-18.731 


ATOM 


858 


0 


GLU 


A 


147 


-17.643 


ATOM 


859 


N 


GLU 


A 


148 


-18.879 


ATOM 


860 


CA 


GLU 


A 


148 


-17.747 


ATOM 


861 


CB 


GLU 


A 


148 


-18.233 


ATOM 


862 


CG 


GLU 


A 


148 


-17.145 


ATOM 


863 


CD 


GLU 


A 


148 


-16.549 


ATOM 


864 


OE1 


GLU 


A 


148 


-17.309 


ATOM 


865 


OE2 


GLU 


A 


148 


-15.317 


ATOM 


866 


C 


GLU 


A 


148 


-17.012 


ATOM 


867 


0 


GLU 


A 


148 


-15.798 


ATOM 


868 


N 


ALA 


A 


149 


-17.755 


ATOM 


869 


CA 


ALA 


A 


149 


-17.190 


ATOM 


870 


CB 


ALA 


A 


149 


-18.313 


ATOM 


871 


C 


ALA 


A 


149 


-16.330 


ATOM 


872 


0 


ALA 


A 


149 


-15.312 


ATOM 


873 


N 


ARG 


A 


150 


-16.766 


ATOM 


874 


CA 


ARG 


A 


150 


-16.060 


ATOM 


875 


CB 


ARG 


A 


150 


-16.901 


ATOM 


876 


CG 


ARG 


A 


150 


-16.430 


ATOM 


877 


CD 


ARG 


A 


150 


-16.508 


ATOM 


878 


NE 


ARG 


A 


150 


-15.353 


ATOM 


879 


CZ 


ARG 


A 


150 


-15.309 


ATOM 


880 


NH1 


ARG 


A 


150 


-16.366 


ATOM 


881 


NH2 


ARG 


A 


150 


-14.208 


ATOM 


882 


C 


ARG 


A 


150 


-14.709 


ATOM 


883 


0 


ARG 


A 


150 


-13.703 


ATOM 


884 


N 


CYS 


A 


151 


-14.692 


ATOM 


685 


CA 


CYS 


A 


151 


-13.472 


ATOM 


886 


CB 


CYS 


A 


151 


-13.826 


ATOM 


887 


SG 


CYS 


A 


151 


-14.939 


ATOM 


888 


C 


CYS 


A 


151 


-12.553 


ATOM 


889 


0 


CYS 


A 


151 


-11.336 


ATOM 


890 


N 


LEU 


A 


152 


-13.143 


ATOM 


891 


CA 


LEU 


A 


152 


-12.379 


ATOM 


892 


CB 


LEU 


A 


152 


-13.319 


ATOM 


893 


CG 


LEU 


A 


152 


-12.799 


ATOM 


894 


CD1 


LEU 


A 


152 


-13.939 


ATOM 


895 


CD2 


LEU 


A 


152 


-11.635 


ATOM 


896 


C 


LEU 


A 


152 


-11.585 



73.378 10.297 1.00 39.21 A O 

74.640 11.799 1.00 37.15 A N 

75.480 12.162 1.00 35.62 A C 

76.702 12.901 1.00 35.22 A C 

77.363 12.167 1.00 36.79 A C 

78.804 12.568 1.00 36.64 A C 

79.558 11.356 1.00 37.59 A N 
80.508 11.256 1.00 38.39 A C 
80.846 12.306 1.00 39.75 A N 
81.103 10.091 1.00 39.75 A N 
74.721 13.041 1.00 35.95 A C 
74.718 12-801 1.00 37.70 A 0 
74.111 14.073 1.00 15.00 A 
73.315 14.947 1.00 15.00 A 
72.757 16.146 1.00 15.00 A , 
71.794 16.943 1.00 15.00 A 
73.895 17.027 1.00 15.00 A 
72.150 14.187 1.00 15.00 A 
71.868 14.315 1.00 32.67 A 
71.551 13.366 1.00 34.18 A N 
70.453 12.562 1.00 35.76 A C 
69.844 11.754 1.00 36.99 A C 
68.638 12.410 1.00 39.76 A C 
68.305 11.871 1.00 41.26 A C 
67.270 12.199 1.00 44.85 A 0 
69.177 11.045 1.00 41.78 A N 
70.947 11.632 1.00 35.69 A C 
70.236 11.361 1.00 34.57 A O 
72.175 11.152 1.00 36.94 A N 
.72.717 10.229 l.OO 38.92 - A C 
73.952 9.549 1.00 39.06 A C 
74.249 8.323 1.00 42.53 A C 
75.714 8.117 1.00 44.76 A C 
76.501 8.753 1.00 46.09 A 0 
76.082 7.312 1.00 47.73 A 0 
73.089 10.918 1.00 40.05 A C 
72.974 10.352 1.00 40.31 A O 

73.566 12.145 1.00 40.90 A N 
73.986 12.932 1.00 40.72 A C 
74.815 14.113 1.00 41.46 A C 
75.645 14.760 1.00 44.00 A C 
76.691 13.809 1.00 44.92 A C 
77.561 13.316 1.00 43.26 A 0 
76.632 13.566 1.00 44.21 A O 
72.745 13.409 1.00 40.38 A C 
72.761 13.549 1.00 40.91 A O 
71.665 13.644 1.00 40.56 A N 
70.397 14.114 1.00 40.37 A C 
69.447 14.510 1.00 38.08 A C 
69.759 13.049 1.00 40.96 A C 
69.137 13.348 1.00 40.04 A O 
69.905 11.800 1.00 44.44 A N 
69.353 10.640 1.00 45.97 A C 

69.567 9.373 1.00 49.29 A C 
68.798 8.131 . 1.00 54.67 A C 
69.664 6.844 1.00 57.99 A C 

70.559 6.736 1.00 61.38 A N 
71.667 6.003 1.00 62.75 A C 
72.040 5.291 1.00 64.70 A N 
72.411 6.002 1.00 62.86 A N 
70.066 10.521 1.00 44.34 A C 
69.442 10.215 1.00 43.79 A 0 
71.367 10.801 1.00 42.95 A N 
72.163 10.724 1.00 42.58 A C 
73.635 10.629 1.00 42.73 A C 
73.927 9.256 1.00 50.77 A S 
71.935 11.913 1.00 41.82 A C 
71.991 11.788 1.00 41.84 A 0 
71.686 13.072 1.00 41.86 A N 
71.432 14.272 1.00 41.07 A C 
71.225 15.451 1.00 41.54 A C 
71.468 16.874 1.00 42.23 A C 
71.219 17.847 1.00 43.20 A C 
70.577 17.205 1.00 42.24 A C 
70.163 14.030 1.00 42.07 A C 
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87.912 43.563 1.00 34.02 A O 

87.723 42.929 1.00 35.79 A N 

88.778 43.819 1.00 37.21 A C 

90.133 43.322 1.00 35.38 A C 

88.727 43.811 1.00 39.20 A C 
88.753 42.742 1.00 40.02 A O 

88.633 44.988 1.00 41.55 A N 
88.588 45.064 1.00 44.82 A C 
88.325 46.496 1.00 44.24 A C 
86.976 46.979 1.00 43.13 A C 
89.427 47.409 1.00 44.26 A C 
89.900 44.579 1.00-46.76 A C 
90.945 44.679 1.00 47.48 A O 
89.843 44.054 1.00 49.35 A N 
91.040 43.544 1.00 52.61 A C 
90.777 42.177 1.00 53.22 A C 
89.441 41.761 1.00 52.18 A O 
91.815 41.159 1.00 54.60 A C 
91.479 44.436 1.00 54.39 A C 
92.322 44.054 1.00 53.90 A O 
90.882 45.614 1.00 57.58 A N 
91.151 46.562 1.00 61.26 A C 
90.174 46.366 1.00 62.07 A C 
90.266 44.816 1.00 66.07 A S 
90.893 47.933 1.00 63.31 A C 

90.507 48.088 1.00 64.75 A O 
91.062 48.933 1.00 64.45 A N 
90.804 50.281 1.00 65.63 A C 
91.832 51.230 1.00 66.44 A C 
93.130 51.184 1.00 67.17 A C 

93.508 50.079 1.00 67.20 A O 
93.769 52.245 1.00 69.25 A O 
69.413 50.684 1.00 65.78 A C 
89.209 51.793 1.00 67.57 A O 
88.446 49.797 1.00 64.68 A N 
87.078 50.107 1.00 64.44 A C 
86.096 48.987 1.00 62.54 A C 
86.523 47.669 1.00 62.77 A C 
86.074 48.846 1.00 61.31 A C 
85.110 49.753 1.00 61.00 A C 
86.594 51.442 1.00 65.35 A C 
87.110 51.933 1.00 65.09 A O 
85.620 52.037 1.00 65.89 A N 
85.032 53.293 1.00 66.45 A C 
85.182 54.391 1.00 67.20 A C 
84.963 53.927 1.00 69.15 , A C 
86.154 53.111 1.00 70.92 A C 
87.389 54.007 1.00 72.48 A C 

88.634 53.269 1.00 72.49 A N 
83.562 53.041 1.00 66.05 A C 
82.687 53.439 1.00 68.24 A O 
83.294 52.358 1.00 64.23 A N 
81.930 52.057 1.00 62.63 A C 
81.918 51.352 1.00 59.42 A C 
80.586 50.843 1.00 55.85 A C 
79.970 49.844 1.00 55.27 A C 
79.944 51.359 1.00 54.53 A C 

78.728 49.368 1.00 55.27 A C 
78.716 50.896 1.00 53.88 A C 
78.100 49.898 1.00 55.99 A C 
81.091 53.329 1.00 63.77 A C 
81.497 54.312 1.00 63.38 A O 
79.918 53.300 1.00 65.04 A N 
79.024 54.451 1.00 65.51 A C 
78.646 54.825 1.00 63.50 A C 
77.687 53.848 1.00 61.32 A C 
76.263 54.207 1.00 58.82 A C 
75.282 53.938 1.00 55.06 A N 
74.810 52.733 1.00 53.84 A C 
75.237 51.679 1.00 54.33 A N 
73.888 52.585 1.00 53.97 A N 
79.721 55.615 1.00 67.10 A C 
79.486 55.885 1.00 68.79 A O 
80.586 56.305 1.00 67.34 A N 
81.310 57.413 1.00 67.66 A C 



SUBSTITUTE SHEET (RULE 26) 



WO 03/035693 



37/514 



PCT/GB02/04872 



Figure 1 
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80.331 58.517 1.00 68.64 A C 

80.749 59.230 1.00 70.93 A C 

80.929 .60.455 1.00 72.31 A 0 

60.902 56 .468 1.00 70.66 A N 

82.314 57.924 1.00 67.57 A C 

82.663 59.105 1.00 67.52 A 0 
82.778 57.016 1.00 67.16 A N 
83.723 57.363 1.00 65.23 A C 
83.170 56.932 1.00 64.54 A C 
82.012 57.777 1.00 64.45 A C 
82.212 59.079 1.00 64.40 A C 
81.158 59.870 1.00 64.90 A C 
80.714 57.277 1.00 64.82 A C 

79.645 58.063 1.00 65.18 A C 
79.877 59.362 1.00 65.33 A C 
78.831 60.162 1.00 66.76 A 0 
85.127 56.811 1.00 65.32 A C 
86.050 57.198 1.00 66.83 A 0 
85.293 55.903 1.00 63.55 A N 
86.616 55.355 1.00 61.98 A . C 
87.482 56.440 1.00 61.61 A C 
88.959 56.192 1.00 61.77 A C 
89.161 54.957 1.00 62.33 A C 
89.499 57.427 1.00 62.30 A C 
87.366 54.777 1.00 61.24 A C 
88.029 55.505 1.00 61.52 A O 
87.275 53.460 1.00 60.09 A N 
87.952 52.770 1.00 57.67 A C 
87.052 51.718 1.00 57.13 A C 
87.693 51.242 1.00 56.08 A C 
85.662 52.300 1.00 56.63 A C 
84.653 51.283 1.00 56.68 A C 
89.170 52.054 1.00 56.74 A C 
89.049 51.218 l.OO 55.80 A 0 
90.366 52.380 1.00 55.87 A N 

90.580 53.391 1.00 54.93 A C 
91.644 51.795 1.00 54.53 A C 

92.664 52.644 1.00 53.78 A C 
91.933 53.023 1.00 54.61 A C 
91.786 50.304 l.OO 54.13 A C 
91.203 49.763 1.00 53.18 A O 
92.567 49.621 1.00 54.37 A N 
92-787 48.191 1.00 54.48 A C 
93.732 47.668 1.00 56.00 A C 
94.040 46.183 1.00 59.08 A C 
95.055 45.710 1.00 61.92 A C 
96.381 46.449 1.00 64.65 A C 
97.420 46.017 1.00 67.40 A N 
93.344 47.833 1.00 52.99 A C 
94.165 48.555 1.00 54.31 A ' 0 
92.885 46.709 1.00 51.34 A N 
93.356 46.261 1.00 49.14 A C 

92.581 46.791 1.00 46.48 A C 
92.670 46.213 1.00 49.09 A 0 
91.822 47.873 1.00 47.00 A N 
91.034 48.487 1.00 44.87 A C 
90.197 49.685 1.00 45.93 A C 
91.009 50.557 1.00 47.57 A 0 
89.630 .50.457 1.00 46.50 A C 
90.044 47.535 1.00 42.49 A C 
89.241 46.888 1.00 41.61 A 0 
90.079 47.473 1.00 39.60 A N 
89.157 46.602 1.00 37.10 A C 
89.561 46.451 1.00 37.22 A C 

90.646 45.516 1.00 38.65 A 0 
88.404 45.948 1.00 36.65 A C 
87.712 47.070 1.00 34.29 A C 
87.413 48.251 1.00 33.82 A 0 
86.814 46.120 1.00 32.63 A N 
85.410 46.442 1.00 30.30 A C 
84.897 46.055 1,00 29.55 A C 
83,442 46.435 1.00 28.60 A C 
85.741 46.721 1.00 30.52 A C 
85.721 48.221 1.00 29.87 A C 
64.578 45.675 1.00 29.54 A C 



SUBSTITUTE SHEET (RULE 26) 



WO 03/035693 



38/514 



PCT/GB02/04872 



Figure 1 
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37.686 19.995 1.00 54.82 B N 

37.944 21.416 1.00 53.44 B C 

39.366 21.721 1.00 51.44 B C 

39.656 20.931 1.00 47.97 B C 

39.487 23.210 1.00 50.28 B C 

36.942 21.982 1.00 54.92 B C 

36.224 21.239 1.00 57.43 B 0 

36.896 23.303 1.00 55.01 B N 

35.976 23.965 1.00 54.91 B C 

36.285 23.611 1.00 51.81 B C 

37.576 24.126 1.00 50.25 B C 

38.503 23.275 1.00 50.28 B C 

37.882 25.463 1.00 50.54 B C 

39.726 23.753 1.00 50,20 B C 

39.094 25.958 1.00 51.24 B C 

40.026 25.102 1.00 52.11 B C 

34.566 23.526 1.00 56.67 B C 

33.822 23.338 1.00 58.74 B 0 

34.167 23.335 1.00 58.18 B N 

32.794 22.906 1.00 60.31 B C 
32.747 21.393 1.00 59.40 B C 
32.576 21.089 1.00 61.82 B 0 
31.992 23.663 1.00 62.32 B C 
32.524 24.065 1.00 60.79 B 0 
30.704. 23.868 1.00 66.51 B N 

29.765 24.581 1.00 69.32 B C 
28.313 24.445 1.00 70.77 B C 
27.445 25.690 1.00 72.73' B C 
27.476 26.267 1.00 73.36 B 0 
26.652 26.088 1.00 73.16 B N 
29.866 23.964. 1.00 69.09 B C 
29.777 22.753 1.00 67.03 B 0 
30.036 24.784 1.00 15.00 B N 
30.201 24.364 1.00 15.00 B C 
29.311 23.189 1.00 15.00 B C 
29.811 22.202 1.00 71.45 B O 
28.005 23.225 1.00 62.90 B N 
27.093 22.138 1.00 61.34 B C 
25.691 22.441 1.00 61.29 B C 
27.571 20.766 1.00 61.36 B C 

27.795 19.850 1.00 64.60 B 0 
27.6B0 20.626 1.00 15.00 B N 
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26.766 19.482 1.00 15.00 B C 
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32.240 21.665 1.00 51.94 B C 
33.598 21.782 1.00 49.03 B C 
33.961 22.434 1.00 47.45 B C 
35.353 22.272 1.00 46.75 B C 
33.248 23.149 1.00 45.26 B C 
34,744 21.263 1.00 47.88 B C 
35.804 21.549 1.00 48.19 B N 
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31.483 22.660 1.00 67.60 B C 
32.417 22.592 1.00 69.23 B N 
31.345 18.011 1.00 56.27 B C 
32.289 17,384 1.00 56.64 B O 
30.306 17.422 1.00 55.63 B N 
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63.866 14.880 1.00 33.10 B C 

64.599 11.253 1.00 28.98 B C 

65.751 11.115 1.00 28.26 B 0 

64.107 10.602 1.00 30.03 B N 

64.935 9.768 1.00 30.53 B C 

64.139 9.486 1.00 30.83 B C 

65.011 8.974 1.00 34.66 B 0 
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66.707 6.165 1.00 33.03 B 0 

65.742 8.151 1.00 32.71 B N 
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68.068 8.744 1.00 35.55 B C 
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71.442 7.755 1.00 42.60 B C 
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71.213 5.635 1.00 39.70 B C 
69.251 6.599 1.00 39.82 B C 
70.449 4.603 1.00 39.52 B C 
68.481 5.574 1.00 39.13 B C 
69.079 4.572 1.00 38.68 B C 
72.836 B.642 1.00 45.04 B C 
72.488 9.114 1.00 44.44 B O 
73.771 9.137 1.00 15.00 B N 
74.434 10.419 1.00 15.00 B C 
75.939 10.216 1.00 15.00 B C 
76.278 9.599 1.00 15.00 B C 
75.735 9.994 1.00 15.00 B O 

77.183 8.628 1.00 15.00 B N 

74.184 11.354 1.00 15.00 B C 
74.327 10.853 1.00 50.60 B O 
73.766 12.520 1.00 52.02 B N 
73.428 13.407 1.00 54.10 B C 

72.376 14.398 1.00 52.43 B C 
71.238 13.789 1.00 50.99 B C 
70.283 14.898 1.00 50.86 B C 
70.505 12.719 1.00 49.00 B C 
74.620 14.148 1.00 56.41 B C 
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75.087 15.133 1.00 57.85 B O 

75.126 13.667 1.00 15.00 B N 

76.227 14.357 1.00 15.00 B C 

77.169 13.346 1,00 15.00 ' B C 

77.975 12.516 1.00 15.00 B C 

78.905 11.553 1.00 15.00 B C 

79.709 10.722 1.00 15.00 B C 

80.626 9.773 1.00 15.00 B N 

75.711 15.338 1.00 15.00 B C 

74.783 15.019 1.00 61.44 B O 

76.323 16.512 1.00 64.04 B N 

75.943 17.565 1.00 66.10 B C 
76.157 18.935 1.00 65.18 B C 
75.767 19.971 1.00 65.17 B O 
76.742 17.495 1.00 68.11 B C 
77.730 16.771 1.00 68.86 B O 
76.312 18.245 1.00 70.83 B N 

77.030 18.254 1.00 73.31 B C 
76.047 18.267 1.00 72.85 B C 

75.031 17.118 1.00 73.20 B C 
74.175 17.222 1.00 73.51 B C 
75.762 15.790 1.00 73.48 B C 

77.944 19.474 1.00 74.86 B C 
79.098 19.381 1.00 75.81 B O 
77.431 20.616 1.00 76.31 B N 
78.219 21.839 1.00 78.64 fi C 
77.419 23.034 1.00 79.03 B C 
76.283 23..410 1.00 76.93 B C 
78.352 -24.222 1.00 79.18 B C 
78.599 22.152 1.00 80.12 B C 
78.061v 21.556 1.00 80.84 B O 
79.543. 23.069 1.00 15.00 B N 
79.932:: 23V480 1.00 15.00 B C 
80.957^ 24;. 614 1.00 15.00 B C 
82.305. 24.149 1.00 15.00 B C 
82:671 22;. 984 1.00 15.00 B O 
82.998 24.952 1.00 15.00 B 0 
78.718 23.930 1.00 15.00 B C 
77.989 24:. 803 1.00 83.23 B 0 
78.470 23.346 1.00 82.35 B N 
79.159 22.211 1.00 82.04 B C 
77.309 23.756 1.00 82.19 B C 
77.552 23.061 1.00 81.60 B C 
78.157 21.771 1.00 81.26 B C 
77.291 25.275 1.00 82.02 B C 
76.237 25.891 1.00 81.65 B O 
78.479 25.864 1.00 82.36 B N 
78.631 27.307 1.00 83.12 B C 
80.089 27.708 1.00 84.14 B C 
81.045 27.544 1.00 B5.05 B C 
80.576 28.371 1.00 86.12 B C 
81.732 28.739 1.00 87.52 B C 
82.688 29.674 1.00 88.59 B N 
77.728 27.998 1.00 82.97 B C 
77.397 29.175 1.00 83.47 B O 
77.300 27.252 1.00 15.00 B N 
76.484 27.820 1.00 15.00 B C 
77.230 27.750 1.00 15.00 B C 
78.379 28.734 1.00 15.00 B C 
78.246 29.894 1.00 15.00 B O 
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75.149 27.091 1.00 15.00 B C 
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74.182 27.406 1.00 79.45 B N 
72.871 26.768 1.00 78.10 B C 
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73.555 24.265 1.00 73.56 B C 
73.603 23.207 1.00 72.28 B C 
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71.746 27.719 1.00 78.56 B C 
71.996 28.545 1.00 79.49 B 0 
70.625 27.620 1.00 77.72 B O 
66.742 31.537 1.00 70.95 B C 
65.967 32.726 1.00 70.66 B O 
66.508 31.003 1.00 71.45 B C 
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76.517 39.745 1.00 49.83 0 

76.991 37.650 1.00 48.20 N 

75.831 37.698 1.00 48.44 C 

75.565 36.399 1.00 52.80 C 

74.300 36.427 1.00 46.27 0 

75.778 38.712 1.00 51.75 C 

74.622 39.247 1.00 50.48 0 

76.941 39.105 1.00 51.90 N 

76.875 40.165 1.00 52.23 C 

78.162 40.272 1.00 56.15 C 
78.174 40.787 1.00 66.01 C 
77.775 39.652 1.00 70.39 C 
78.875 38.593 1.00 70.21 C 
78.711 37.871 1.00 82.42 N 

76.636 41.594 1.00 53.36 C 
75.984 42.513 1.00 56.96 0 
77.056 41.786 1.00 47.99 N 
76.727 42.963 1.00 45.84 C 
77.413 43.000 1.00 44.84 C 
78.766 42.998 1.00 53.83 0 
75.249 43.044 1.00 45.50 C 
74.612 44.105 1.00 45.01 0 
74.601 41.961 1.00 45.21 N 
73.120 42.082 1.00 43.84 C 
72.714 40.778 1.00 41.31 C 
73.022 40.315 1.00 41.32 C 
72.210 38.955 1.00 41.46 C 
72.393 41.126 1.00 40.27 C 
72.369 42.560 1.00 43.71 C 
71.404 43.187 1.00 44.07 0 
72.933 42.280 1.00 43.30 N 
72.278 42.643 1.00 42.87 C 
72.861 41.817 1.00 40.16 C 

72.637 40.326 1.00 45.23 O 

72.163 42.196 1.00 35.96 C 

72.374 44.128 1.00 41.51 C 

71.375 44.795 1.00 44.31 0 
73.603 44.624 1.00 40.63 N 
73.980 46.013 1.00 34.75 C 
75.423 46.067 1.00 34.48 C 
76.338 45.698 1.00 36.13 C 
75.840 45.500 1.00 39.61 0 
77.663 45.591 1.00 29.29 N 
73.081 46.700 1.00 34.36 C 
72.171 47.498 1.00 34.45 O 
73.155 46.308 1.00 31.97 N 
72.209 46.917 1.00 31.71 C 
72.248 46.353 1.00 33.83 C 
73.523 46.630 1.00 36.13 C 
73.497 45.620 1.00 42.55 C 
73.445 48,027 1.00 38.64 C 
70.811 46.940 1.00 33.08 C 
70.116 47.912 1.00 35.13 0 
70.300 45.886 1.00 36.21 N 
68.850 45.914 1.00 34.71 C 
68.361 44.594 1.00 36.52 C 
69.050 44.229 1.00 30.72 0 
68.596 46.916 1.00 34.96 C 
67.500 47.456 1.00 29.82 0 
69.617 47.187 1.00 35.06 N 
69.385 48.177 1.00 36.79 C 
70.636 48.222 1.00 38.03 C 
70.663 47.768 1.00 43.50 C 
70.488 46.322 1.00 59;74 C 
71.105 45.821 1.00 64.36 C 
72.649 46.483 1.00 58.10 N 
69.212 49.544 1.00 39.23 C 
68.736 50.439 1.00 39.75 0 
69.628 49.754 1.00 39.50 N 
69.271 51.023 1.00 33.52 C 
70.251 51.675 1.00 34.28 C 
71.567 51.107 1.00 36.85 C 
70.112 51-858 1.00 37.83 C 
68.255 50.834 1.00 34.23 C 
67.440 51.739 1.00 30.21 0 
68.092 49.704 1.00 31.83 N 
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Figure 2 
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Figure 2 
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10.055 63.297 2.987 1.00 42.37 C 
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Figure 2 
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Figure 2 
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Figure 2 
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79.378 40.057 1.00 50.23 C 
80.200 40.669 1.00 54.76 C 
79.350 41.379 1.00 60.20 C 
78.048 41.465 1.00 69.56 0 
79.990 41.973 1.00 67.99 N 
79.834 40.577 1.00 50.18 • C 
79.344 41.614 1.00 50.24 0 
80.752 39.834 1.00 48.26 N 
81.227 40.17B 1.00 46.01 C 
82.584 40.640 1.00 47.67 C 
82.674 41.788 1.00 47.42 C 
83.726 41.967 1.00 52.39 C 
83.910 43.096 1.00 59.52 C 
82.990 44.096 1.00 63.37 C 
81.869 43.950 1.00 60.97 C 
81.746 42.745 1.00 57.27 c 
81.289 38.993 1.00 45.88 C 
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85.306 26.809 1.00 45.90 C 

84.974 27.499 1.00 48.27 0 

84.613 25.830 1.00 45.13 N 

83.470 25.728 1.00 47.17 C 

82.718 26.918 1.00 41.72 C 

83.024 27.452 1.00 48.23 C 
82.576 28.709 1.00 49.66 N 
82.991 28.914 1.00 48.56 C 
83.667 27.851 1.00 49.12 N 
83.699 26.932 1.00 54.25 C 
83.750 24.687 1.00 49.39 C 
82.840. 24.010 1.00 52.51 O 
85.007 24.536 1.00 48.87 N 
85.400 23.520 1.00 48.67 C 
86.639 23.935 1.00 49.28 C 
86.308 24.803 1.00 53.14 C 
85.079 24.993 1.00 48.05 0 
87.455 25.324 1.00 50.87 N 
85.708 22.368 1.00 46.50 C 
85.516 21.228 1.00 48.81 0 
86.227 22.646 1.00 45.43 N 
86.460 21.553 1.00 45.64 C 
87.445 22.004 1.00 47.70 C 
88.836 21.221 1.00 53.98 C 
89.910 22.216 1.00 64.66 C 
89.836 23.737 1.00 66.02 C 
90.701 23.913 1.00 69.73 N 
85.100 21.072 1.00 41.47 C 
84.816 19.887 1.00 44.35 0 
84.219 21.978 1.00 34.85 N 
82.967 21.629 1.00 34. 4B C 
82.143 22.977 1.00 31.50 C 
82.732 23.718 1.00 37.06 C 

82.025 24.905 1.00 39.64 C 
82.988 22.791 1.00 41.47 C 
82.351 20.808 1.00 35.00 C 
81.953 19.761 1.00 35.44 0 
82.341 21.327 1.00 38.81 N 
81.739 20.580 1.00 37.07 C 

81.798 21.398 1.00 34.97 C 
80.618 22.398 1.00 37.00 C 

80.657 23.123 1.00 36.25 C 
79.363 22.016 1.00 35.64 C 
82.367 19.204 1.00 36.64 C 
81.756 18.258 1.00 35.79 O 
83.641 19.136 1.00 37.48 N 
84.213 17.900 1.00 39. 5B C 
85.611 18.088 1.00 39.74 C 
86.348 16.944 1.00 52.54 C 
87.926 17.167 1.00 62.82 C 
87.934 17.791 1.00 69.67 C 
88.260 19.482 1.00 68.35 N 

83.799 16.971 1.00 40.86 C 
83.333 15.904 1.00 42.13 0 
83.853 17.396 1.00 43.68 N 
83.459 16.493 1.00 38.89 C 
83.741 17.263 1.00 41.95 C 
85.242 17.335 1.00 43.03 C 
85.667 18.166 1.00 44.01 0 
85.986 16.548 1.00 41.76 N 
82.07B 16.033 1.00 34.42 C 
81.737 14.828 1.00 34.81 O 
81.206 16.893 1.00 31.63 N 
79.852 16.470 1.00 32.14 C 
79.093 17.524 1.00 32.23 C 

77.658 17.208 1.00 35.60 C 
79.140 18.490 1.00 36.37 C 
79.755 15.428 1.00 37.98 C 
78.983 14.314 1.00 42.65 O 
80.543 15.651 1.00 39.20 N 
80.424 14.654 1.00 39.22 C 
81.224 15.084 1.00 41.36 C 
80.887 13.353 1.00 35.38 C 
80.286 12.346 1.00 38.73 0 
81.981 13.340 1.00 34.04 N 
82.527 12.033 1.00 35.60 C 
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gure 2 

10.771 11.271 1.00 31.94 C 

11.158 10.873 1.00 24.52 O 

13.011 12.058 1.00 26.58 C 

13.973 11.652 1.00 24.35 O 

12.878 13.298 1.00 25.40 N 

13.938 14.199 1.00 29.22 C 

13.643 15.668 1.00 31.94 C 

13.455 16.040 1.00 36.62 C 

14.731 15.989 1.00 43.89 C 
14.714 16.477 1.00 32.44 C 
13.860 14.059 1.00 29.06 C 
14.792 14.083 1.00 32.2$ O 
12.738 13.951 1.00 29.76 N 
12.798 13.878 1.00 29.75 C 
11.482 14.255 1.00 28.57 C 
11.386 13.898 1.00 30.13 C 
10.018 14.137 1.00 35.25 C 

9.006 14.448 1.00 41.22 O 

9.929 14.034 1.00 40.48 O 

13.518 12.804 1.00 29.53 C 

14.130 12.911 1.00 29.57 O 

13.574 11.709 I. 00 36.74 N 

14.386 10.511 1.00 37.06 C 

14.288 9.286 1.00 38.80 C 

13.089 8.415 1.00 51.24 C 

12.895 8.064 1.00 58.04 O 

12.252 7.968 1.00 70.15 O 

15.816 10.897 1.00 34.68 C 

16.398 ■ 10.940 1.00 37.09 O 

16.317 11.320 1.00 28.48 N 

17.674 11.917 1.00 25.06 C 

17.749 12.509 1.00 23.78 C 

17.732 11.556 1.00 19.69 C 
17.948 12.216 1.00 26.08 C 
17.219 11.849 1.00 27.90 N 
17.440 10.872 1.00 30.52 C 
18.398 10.079 1.00 40.81 N 
16.703 10.701 1.00 38.85 N 
17.974, 12.860 1.00 25.85 C 
18.853 12.617 -1.00 28.39 O 
17.142 13.873 -1.00 23.15 N 
17.578 14.665 1.00 19.32 C 
16.785 15.890 1.00 18.87 C 
17.153 16.710 1.00 16.20 C 
16.969 16.622 1.00 19.56 C 
17.557 13.982 1.00 25.68 C 
18.494 14.221 1.00 30.26 O 

16.603 13.073 1.00 27.14 N 
16.687 12.285 1.00 26.50 C 
15.543 11.406 1.00 26.46 C 
14.142 12.227 1.00 32.82 C 
12.986 11.466 1.00 36.97 C 

12.604 11.480 1.00 34.49 O 
12.393 10.724 1.00 40.06 N 
17.834 11.404 1.00 27.30 C 
18.409 11.235 1.00 27.70 O 
18.277 10.871 1.00 28.08 N 
19.504 10.068 1.00 28.92 C 
19.755 9.273 1.00 29.57 C 
20.986 8.436 1.00 28.44 C 
21.537 7.892 1.00 38.61 C 
21.228 8.450 1.00 40.53 O 
22.379 6.910 1.00 42.19 O 
20.730 10.924 1.00 30.33 C 
21.482 10.579 1.00 32.59 O 
20.913 12.036 1.00 24.62 N 
22.027 12.915 1.00 26.05 C 
22.105 14.102 1.00 25.13 C 
23.402 14.770 1.00 29.49 C 
24.516 13.861 1.00 35.44 C 
24.176 13.104 1.00 38.52 O 
25.626 13.868 1.00 26.16 O 
21.903 13.357 1.00 25.34 C 
22.747 13.567 1.00 27.01 O 
20.754 13.495 1.00 29.08 N 
20.613 13.969 1.00 31.51 C 



SUBSTITUTE SHEET (RULE 26) 



WO 03/035693 



159/514 



PCT/GB02/04872 



ATOM 


4634 


CB 


ALA B 149 


ATOM 


4635 


C 


ALA B 149 


ATOM 


4636 


0 


ALA B 149 


ATOM 


4637 


N 


ARG B 150 


ATOM 


4638 


CA 


ARG B 150 


ATOM 


4639 


CB 


ARG B 150 


ATOM 


4640 


CG 


ARG B 150 


ATOM 


4641 


CD 


ARG B 150 


ATOM 


4642 


NE 


ARG B 150 


ATOM 


4643 


CZ 


ARG B 150 


ATOM 


4644 


MU 


. ARG B 150 


ATOM 


4645 


NH2 


! ARG B 150 


ATOM 


4646 


C 


ARG B 150 


ATOM 


4647 


0 


ARG B 150 


ATOM 


4646 


N 


CYS B 151 


ATOM 


4649 


CA 


CYS B 151 


ATOM 


4650 


CB 


CYS B 151 


ATOM 


4651 


SG 


CYS B 151 


ATOM 


4652 


C 


CYS B 151 


ATOM 


4653 


O 


CYS B 151 


ATOM 


4654 


N 


LEU B 152 


ATOM 


4655 


CA 


LEU B 152 


ATOM 


4656 


CB 


LEU B 152 


ATOM 


4657 


CG 


LEU B 152 


ATOM 


4658 


CD1 


LEU B 152 


ATOM 


4659 


CD2 


LEU B 152 


ATOM 


4660 


C 


LEU B 152 


ATOM 


4661 


O 


LEU B 152 


ATOM 


4662 


N 


VAL B 153 


ATOM 


4663 


CA 


VAL B 153 


ATOM 


4664 


CB 


VAL B 153 


ATOM 


4665 


CGI 


VAL B 153 


ATOM 


4666 


CG2 


VAL B 153 


ATOM 


4667 


C 


VAL B 153 


ATOM 


4668 


O 


VAL B 153 


ATOM 


4669 


N 


GLU B 154 


- ATOM 


4670 


CA 


GLU B 154 


ATOM 


4671 


CB 


GLU B 154 


ATOM 


4672 


CG 


GLU B 154 


ATOM 


4673 


CD 


GLU B 154 


ATOM 


4674 


OE1 


GLU B 154 


ATOM 


4675 


OE2 


GLU B 154 


ATOM 


4676 


C 


GLU B 154 


ATOM 


4 677 


O 


GLU B 154 


ATOM 


4678 


N 


GLU B 155 


ATOM 


4679 


CA 


GLU B 155 


ATOM 


4680 


CB 


GLU B 155 


ATOM 


4681 


CG 


GLU B 155 


ATOM 


4682 


CD 


GLU B 155 


ATOM 


4683 


OEl 


GLU B 155 


ATOM 


4684 


OE2 


GLU B 155 


ATOM 


4685 


C 


GLU B 155 


ATOM 


4686 


O 


GLU B 155 


ATOM 


4687 


N 


LEU B 156 


ATOM 


4688 


CA 


LEU B 156 


ATOM 


4689 


CB 


LEU B 156 


ATOM 


4690 


CG 


LEU B 156 


ATOM 


4691 


CD1 


LEU B 156 


ATOM 


4692 


CD2 


LEU B 156 


ATOM 


4693 


C 


LEU B 156 


ATOM 


4694 


O 


LEU B 156 


ATOM 


4695 


N 


ARG B 157 


ATOM 


4696 


CA 


ARG B 157 


ATOM 


4697 


CB 


ARG B 157 


ATOM 


4698 


CG 


ARG B 157 


ATOM 


4699 


CD 


ARG B 157 


ATOM 


4700 


NE 


ARG B 157 


ATOM 


4701 


CZ 


ARG B 157 


ATOM 


4702 


NH1 


ARG B 157 


ATOM 


4703 


NH2 


ARG B 157 


ATOM 


4704 


C 


ARG B 157 


ATOM 


4705 


O 


ARG B 157 


ATOM 


4706 


N 


LYS B 158 


ATOM 


4707 


CA 


LYS B 158 


ATOM 


4708 


CB 


LYS B 158 


ATOM 


4709 


CG 


LYS B 158 



Figure 2 
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Figure 2 

100.363 32.372 9.921 

99.754 32.059 8.496 

98.626 33.123 8.183 

104.411 33.485 12.572 

104.855 34.529 12.494 

104.382 32.973 13.694 

105.060 33.756 14.693 

104.967 33.023 15.990 

105.495 31.714 15.813 

103.494 32.682 16.412 

106.456 33.937 14.425 

107.102 34.483 15.222 

106.873 33.437 13.312 

108.178 33.663 12.887 

108.165 35.089 12.428 
107.487 35.389 U.074 
108.031 36.824 10.417 
109.474 36.651 9.757 
110.208 38.075 9.343 
109.306 33.385 13.918 
110.492 33.589 13.658 
109.008 32.732 14.996 
110.041 32.275 15.888 
111.136 31.860 15.274 
110.384 33.228 16.921 
111.321 33.055 17.542 
109.553 34.207 17.168 
109.778 35.X59 18.140 
109.476 36.517 17.510 
108.060 36.686 17.272 
108.861 35.031 19.270 
107.866 34.662 19.065 
109.182 35.529 20.438 
108.545 35.319 21.627 
109.089 36.435 22.427 
110.376 36.617 22.010 
110.252 36.485 20.643 
107.225 35.565 21.396 
106.896 36.357 20.603 
106.422 34.797 22.137 
105.007 34.781 22.018 
104.720 33.986 20.8S7 
103.455 32.773 20.911 
104.202 34.267 23.135 
104.678 33.612 23.793 
102.969 34.709 23.313 
102.028 34.381 24.411 
101.323 35.597 25.057 
100.163 35.239 25.971 

99.438 36.096 26.776 

99.866 33.956 26.168 

101.036 33.495 23.675 

100.155 33.946 22.790 

101.171 32.188 23.923 

100.400 31.141 23.213 

100.991 29.937 23.779 

101.166 30.271 25.253 
101.955 31.616 24.988 

98.970 31.259 23.623 

98.102 30.752 23.092 

98.648 32.034 24.538 

97.178 31.984 24.827 

96.796 33.059 25.641 

97.642 33.237 26.789 

95.619 32.880 26.098 

96.181 32.065 23.807 

95.355 31.351 23.751 

96.163 32.960 22.866 

95.131 33.179 21.969 

95.403 34.531 21.286 

94.487 34.848 20.181 

94.927 34.834 18.965 

93.999 35.071 18.011 

92.666 35.355 18.377 

92.308 35.395 19.495 

93.150 35.151 20.451 



1.00 61.11 


C 


1.00 71.94 


C 


1.00 81.39 


N 


1.00 54.51 


C 


1.00 61.71 


0 


1.00 53.68 


N 


1.00 52.09 


C 


1.00 52.39 


C 


1.00 58.41 


0 


1.00 46.11 


C 


1.00 49.47 


c 


1.00 47.60 


0 


1.00 51.05 


N 


1.00 52.48 


c 


1.00 56.02 


c 


1.00 63.34 


c 


1.00 71.08 


c 


1.00 79.11 


c 


1.00 85.01 


N 


1.00 50.75 


C 


1.00 49.62 


0 


1.00 48.83 


N 


1.00 47.64 


C 


1.00 45.23 


C 


1.00 49.53 


C 


1.00 52.07 


0 


1.00 50.59 


N 


1.00 50.86 


C 


1.00 52.96 


C 


1.00 58.20 


0 


1.00 49.63 


c 


1.00 51.04 


0 


1.00 48.65 


N 


1.00 46.81 


c 


1.00 49.46 


c 


1.00 46.20 


c 


1.00 48.41 


c 


1.00 46.95 


c 


1.00 49.30 


0 


1.00 49.04 


N 


1.00 44.62 


c 


1.00 43.60 


c 


1.00 38.18 


s 


1.00 40.76 


c 


1.00 39.64 


0 


1.00 37.54 


N 


1.00 36.63 


C 


1.00 31.80 


C 


1.00 39.27 


C 


1.00 38.76 


0 


1.00 44.60 


0 


1.00 38.59 


c 


1.00 40.57 


0 


1.00 35.97 


N 


1.00 31.62 


C 


1.00 36.05 


C 


1.00 34.73 


C 


1.00 36.92 


C 


1.00 24.78 


C 


1.00 24.22 


0 


1.00 21.08 


N 


1.00 19.90 


C 


1.00 18.63 


C 


1.00 23.25 


0 


1.00 22.87 


C 


1.00 26.13 


C 


1.00 37.62 


0 


1.00 30.70 


N 


1.00 28.79 


C 


1.00 31.91 


C 


1.00 32.40 


C 


1.00 34.00 


C 


1.00 35.99 


C 


1.00 24.97 


c 


1.00 26.27 


c 


1.00 30.65 


c 



SUBSTITUTE SHEET (RULE 26) 



WO 03/035693 

161/514 



Figure 2 



ATOM 


4786 


C 


PHE 


B 


168 


95.033 


32.160 


20.929 


ATOM 


4787 


O 


PHE 


B 


168 


93.895 


31.730 


20.620 


ATOM 


4788 


N 


ILE 


B 


169 


96.183 


31.932 


20.313 


ATOM 


4789 


CA 


ILE 


B 


169 


96.340 


30.976 


19.272 


ATOM 


4790 


CB 


ILE 


B 


169 


97.707 


30.841 


19.013 


ATOM 


4791 


CGI 


ILE 


B 


169 


98.132 


31.801 


17.947 


ATOM 


4792 


CD1 


ILE 


B 


169 


99.829 


31.655 


17.390 


ATOM 


4793 


CG2 


ILE 


B 


169 


97.922 


29.568 


18.418 


ATOM 


4794 


C 


ILE 


B 


169 


95.916 


29.571 


19.848 


ATOM 


4795 


O 


ILE 


B 


169 


95.164 


28.871 


19.204 


ATOM 


4796 


N 


LEU 


B 


170 


96.343 


29.186 


21.032 


ATOM 


4797 


CA 


LEU 


B 


170 


95.983 


27.913 


21.565 


ATOM 


4798 


CB 


LEU 


B 


170 


96.727 


27.662 


22.752 


ATOM 


4799 


CG 


LEU 


B 


170 


97.814 


26.702 


22.726 


ATOM 


4800 


CD1 


LEU 


B 


170 


98.158 


26.516 


21.360 


ATOM 


4801 


CD2 


LEU 


B 


170 


99.047 


27.075 


23.534 


ATOM 


4802 


C 


LEU 


B 


170 


94.578 


2B.060 


21.983 


ATOM 


4803 


O 


LEU 


B 


170 


93.966 


27.141 


22.357 


ATOM 


4804 


N 


GLY 


B 


171 


93.966 


29.215 


22.055 


ATOM 


4805 


CA 


GLY 


B 


171 


92.529 


29,096 


22.379 


ATOM 


4806 


C 


GLY 


B 


171 


91.746 


28.922 


21.057 


ATOM 


4807 


O 


GLY 


B 


171 


90.466 


28.595 


20.822 


ATOM 


4808 


N 


CYS 


B 


172 


92.445 


29.246 


20.048 


ATOM 


4809 


CA 


CYS 


B 


172 


91.573 


29.370 


18.809 


ATOM 


4810 


CB 


CYS 


B 


172 


92.459 


30.132 


17.702 


ATOM 


4811 


SG 


CYS 


B 


172 


92.228 


31.865 


17.740 


ATOM 


4812 


C 


CYS 


B 


172 


91.256 


27.977 


18.251 


ATOM 


4813 


O 


CYS 


B 


172 


90.246 


27.725 


17.837 


ATOM 


4814 


N 


ALA 


B 


173 


92.248 


27.099 


18.380 


ATOM 


4815 


CA 


ALA 


B 


173 


92.172 


25.711 


17.952 


ATOM 


4816 - 


CB • 


ALA 


B 


173 


93.542 


25.129 


18.334 


ATOM 


4817 


C 


ALA B 173 


90.998 


24.919 


18.474 


ATOM 


4818 


o 


ALA 


B 


173 


90.165 


24.459 


17.751 


ATOM 


4819 


N 


PRO 


B 


174 


90.869 


24.858 


19.740 


ATOM 


4820 . 


CA 


PRO B'174 


89.810 


24.181 


20.312 


ATOM 


4821- 


CB 


PRO 


B 


174 


90.139 


24.114 


21.780 


ATOM 


4822 . 


CG . 


PRO 


B 174 


91.447 


24.724 


21.935 


ATOM 


4823 


CD 


PRO 


B 


174 


91.641 


25.559 


20.671 


ATOM 


4824- 


C 


PRO 


B 


174 


88.511 


24.829 


20.006 


ATOM 


4825. 


O ^ 


PRO 


B 


174 


87.456 


24.215 


19.800 


ATOM 


4826 


N 


CYS 


B 


175 


88.487 


26.087 


19.889 


ATOM 


4827 


CA 


CYS 


B 


175 


87.160 


26.704 


19.533 


ATOM 


4828 


CB 


CYS 


B 


175 


87.252 


28.198 


19.567 


ATOM 


4829 


SG 


CYS 


B 


175 


85.647 


29.010 


19.387 


ATOM 


4830 


C 


CYS 


B 


175 


86.717 


26.335 


18.179 


ATOM 


4831 


O 


CYS 


B 


175 


85.448 


26.101 


17.825 


ATOM 


4832 


N 


ASN 


B 


176 


87.648 


26.343 


17.340 


ATOM 


4833 


CA 


ASN 


B 


176 


87.337 


25.979 


15.898 


ATOM 


4834 


CB 


ASN 


B 


176 


88.583 


26.231 


15.084 


ATOM 


4835 


CG 


ASN 


B 


176 


88.384 


27.164 


13.862 


ATOM 


4836 


OD1 'ASN 


B 


176 


87.273 


27.552 


13.538 


ATOM 


4837 


ND2 


ASN 


B 


176 


89.567 


27.321 


13.046 


ATOM 


4838 


C 


ASN 


B 


176 


86.927 


24.479 


15.783 


ATOM 


4839 


O 


ASN 


B 


176 


86.092 


24.080 


14.996 


ATOM 


4840 


N 


VAL 


B 


177 


87.486 


23.658 


16.577 


ATOM 


4841 


CA 


VAL 


B 


177 


87.099 


22.283 


16.503" 


ATOM 


4842 


CB 


VAL 


B 


177 


87.785 


21.431 


17.459 


ATOM 


• 4843 


CGI 


VAL 


B 


177 


87.062 


20.155 


17.601 


ATOM 


4844 


CG2 


VAL 


B 


177 


89.136 


21.092 


17.060 


ATOM 


4845 


C 


VAL 


B 


177 


85.705 


22.271 


16.790 


ATOM 


4846 


O 


VAL 


B 


177 


84.926 


21.693 


16.100 


ATOM 


4847 


N 


ILE 


B 


178 


85.264 


22.931 


17.841 


ATOM 


4848 


CA 


ILE 


B 


178 


83.713 


22.924 


18.113 


ATOM 


4849 


CB 


ILE 


B 


178 


83.352 


23.588 


19.385 


ATOM 


4850 


CGI 


ILE 


B 


178 


83.385 


22.683 


20.600 


ATOM 


4851 


CD1 


ILE 


B 


178 


84.643 


22.611 


20.791 


ATOM 


4852 


CG2 


ILE 


B 


178 


81.862 


23.809 


19.280 


ATOM 


4853 


C 


ILE 


B 


178 


83.019 


23.496 


16.978 


ATOM 


4854 


0 


ILE 


B 


178 


81.986 


23.064 


16.541 


ATOM 


4655 


N 


CYS 


B 


179 


83.592 


24.555 


16.358 


ATOM 


4656 


CA 


CYS 


B 


179 


82.884 


25.158 


15.108 


ATOM 


4B57 


CB 


CYS 


B 


179 


83.619 


26.319 


14.513 


ATOM 


4858 


SG 


CYS 


B 


179 . 


83.641 


27.812 


15.597 


ATOM 


4859 


C 


CYS 


B 


179 


82.563 


24.155 


14.001 


ATOM 


4860 


0 


CYS 


B 


179 


81.365 


24.065 


13.326 


ATOM 


4861 


N 


SER 


B 


180 


83.574 


23.371 


13.821 



PCT/GB02/04872 



1.00 29.57 c 

1.00 27.47 0 

1.00 29.02 N 

1.00 27.44 C 

1.00 27.00 C 

1.00 31.87 C 

1.00 27.35 C 

1.00 36.89 C 

1.00 28.15 C 

1.00 29.47 O 

1.00 25.54 N 

1.00 22.81 C 

1.00 24.41 C 

1.00 28.75 C 

1.00 29.89 C 

1.00 31.76 C 

1.00 25.41 C 

1.00 26.24 O 

1.00 26.86 N 

1.00 26.29 C 

1.00 27.95 C 

1.00 25.22 0 

1.00 29.75 N 

1.00 34.69 C 

1.00 38.03 C 

1.00 40.36 S 

1.00 32.91 C 

1.00 32.86 O 

1.00 30.46 N 

1.00 28.93 C 

1.00 31.28 C 

1.00 25.46 C 

1.00 25.36 0 ' rCy -fCA 

1.00 23.81 N ' K,v.V7 : : 

1.00 24.54 C '-■>'■•> 

1.00 21.43 C : ' - > f- 'O 

1.00 26.69 C . ■ .2 V!K 

1.00 25.01 C :*rj 

1.00 26.26 C 

1.00 27.84 O - 

1.00 30.09 N 

1.00 31.67 C 

1.00 35.10 C 

1.00 41.52 S 

1.00 28.66 C 

1.00 29.08 O 

1.00 26.22 N 

1.00 27.87 C 

1.00 30.58 C 

1.00 32.62 C 

1.00 32.59 0 

1.00 28.60 N 

1.00 25.44 C 

1.00 26.43 O 

1.00 25.03 N 

1.00 23.19 C 

1.00 22.29 C 

1.00 31.48 C 

1.00 20.77 C 

1.00 23.32 C 

1.00 22.25 0 

1.00 27.73 N 

1.00 23.65 C 

1.00 23.56 C 

1.00 27.91 C 

1.00 39.44 C 

1.00 26.78 C 

1.00 22.88 C 

1.00 25.28 0 

1.00 28.73 N 

1.00 28.34 C 

1.00 27.25 C 

1.00 39.46 S 

1.00 26.94 C 

1.00 27.96 0 

1.00 26.66 N 



SUBSTITUTE SHEET (RULE 26) 



WO 03/035693 



162/514 



PCT/GB02/04872 



Figure 2 



ATOM 


4862 


CA 


SER 


B 


180 


83.554 


22.343 


12.785 


1.00 


25.98 


C 


ATOM 


4863 


CB 


SER 


B 


180 


84.B76 


21.706 


12.713 


1.00 


24.43 


C 


ATOM 


4664 


OG 


SER 


B 


160 


84.887 


21.117 


11.399 


1.00 


32.83 


0 


ATOM 


4865 


C 


SER 


B 


180 


82.547 


21.396 


13.166 


1.00 


29.47 


C 


ATOM 


4866 


O 


SER 


B 


180 


81.666 


21.163 


12.415 


1.00 


36.80 


0 


ATOM 


4867 


N 


ILE 


B 


181 


82.535 


20.809 


14,358 


1.00 


28.88 


N 


ATOM 


4868 


CA 


ILE 


B 


181 


81.544 


19.866 


14.679 


1.00 


25.12 


C 


ATOM 


4869 


CB 


ILE 


B 


181 


81.600 


19.426 


16.036 


1.00 


26.75 


C 


ATOM 


4870 


CGI 


ILE 


B 


181 


82.979 


16.451 


16.033 


1.00 


26.17 


C 


ATOM 


4871 


CD1 


ILE 


B 


181 


83.642 


18.396 


17.377 


1.00 


30.37 


c 


ATOM 


4872 


CG2 


ILE 


B 


181 


80.692 


18.673 


16.527 


1.00 


31.53 


c 


ATOM 


4873 


C 


ILE 


B 


181 


80.168 


20.461 


14.614 


l.OO 


29.49 


c 


ATOM 


4874 


O 


ILE 


B 


181 


79.166 


19.696 


14.333 


1.00 


27.23 


0 


ATOM 


4875 


N 


ILE 


B 


182 


79.973 


21.776 


14.971 


1.00 


33.16 


N 


ATOM 


4876 


CA 


ILE 


B 


182 


78.512 


22.252 


14.987 


1.00 


29.70 


C 


ATOM 


4877 


CB 


ILE 


B 


182 


78.225 


23.272 


15.934 


1,00 


30.25 


C 


ATOM 


4878 


CGI 


ILE 


B 


182 


78.932 


23.134 


17.335 


1.00 


30.01 


C 


ATOM 


4879 


CD1 


ILE 


B 


182 


78.171 


22.620. 


18.233 


1.00 


31.35 


c 


ATOM 


4880 


CG2 


ILE 


B 


182 


76.862 


23.502 


15.947 


1.00 


29.01 


c 


ATOM 


4881 


C 


ILE 


B 


182 


78.094 


22.769 


13.627 


1.00 


32.50 


c 


ATOM 


4882 


O 


ILE 


B 


182 


76.959 


22.440 


13.168 


1.00 


31.00 


0 


ATOM 


4883 


N 


PHE 


B 


163 


78.944 


23.571 


12.959 


1.00 


31.69 


N 


ATOM 


4884 


CA 


PHE 


B 


183 


78.528 


24.210 


11.731 


1.00 


32.77 


C 


ATOM 


4885 


CB 


PHE 


B 


1B3 


79.172 


25.707 


11.642 


1.00 


34.05 


C 


ATOM 


4886 


CG 


PHE 


B 


183 


78.961 


26.564 


12.793 


1.00 


29.87 


C 


ATOM 


4887 


CD1 


PHE 


B 


183 


79.881 


27.371 


13.310 


1.00 


30.53 


C 


ATOM 


4888 


CE1 


PHE 


B 


163 


79.626 


28.070 


14.442 


1.00 


37.12 


C 


ATOM 


4869 


CZ 


PHE 


B 


183 


78.343 


27.955 


15.055 


1.00 


36.76 


c 


ATOM 


4890 


CE2 


PHE 


B 


183 


77.411 


27.170 


14.523 


1.00 


35.12 


c 


ATOM 


4891 


CD2 


PHE 


B 


183 


77.704 


26.490 


13.406 


1.00 


37.30 


c 


ATOM 


4892 


C 


PHE 


B 


183 


78.977 


23.522 


10.435 


1.00 


35.81 


c 


ATOM 


4893 


O 


PHE 


B 


183 


78.553 


23.868 


9.401 


1.00 


40.32 


0 


ATOM 


4894 


N 


HIS 


B 


184 


79.918 


22.679 


10.489 


1.00 


39.93 


N 


ATOM 


4895 


CA 


HIS 


B 


184 


80.539 


21.838 


9.507 


1.00 


46.66 


C 


ATOM 


4896 


CB 


HIS 


B 


184 


79.753 


21.456 


8.303 


1.00 


50.11 


C 


ATOM 


4897 


CG 


HIS 


B 


184 


80.318 


20.269 


7.565 


1.00 


54.75 


C 


ATOM 


4898 


ND1 


HIS 


B 


184 


80.415 


20.246 


6.180 


1.00 


61.14 


N 


ATOM 


4899 


CEl 


HIS 


B 


184 


80.901 


19.060 


5.806 


1.00 


68.11 


C 


ATOM 


4900 


NE2 


HIS 


B 


184 


81.050 


18.291 


6.898 


1.00 


71.29 


N 


ATOM 


4901 


CD2 


HIS 


B 


184 


80.692 


19.017 


8.013 


1.00 


59.57 


C 


ATOM 


4902 


C 


HIS 


B 


184 


81.829 


22.169 


9.092 


1.00 


49.83 


C 


ATOM 


4903 


O 


HIS 


B 


184 


82.592 


21.295 


8.762 


1.00 


57.32 


0 


ATOM 


4904 


H 


LYS 


B 


185 


82.114 


23.436 


9.117 


1.00 


49.19 


N 


ATOM 


4905 


CA 


LYS 


B 


185 


83.380 


23.855 


8.773 


1.00 


46.16 


c 


ATOM 


4906 


CB 


LYS 


B 


185 


82.954 


24.543 


7.481 


1.00 


52.96 


c 


ATOM 


4907 


CG 


LYS 


B 


165 


83.612 


25.967 


7.138 


1.00 


59.65 


c 


ATOM 


4908 


CD 


LYS 


B 


185 


62.778 


26.639 


5.936 


1.00 


66.28 


c 


ATOM 


4909 


CE 


LYS 


B 


185 


83.602 


28.052 


5.568 


1.00 


72 .,61 


c 


ATOM 


4910 


HZ 


LYS 


B 


185 


83.876 


29.106 ' 


6.786 


1.00 


56.39 


N 


ATOM 


4911 


C 


LYS 


B 


185 


84.008 


24.826 


9.754 


1.00 


41.07 


c 


ATOM 


4912 


0 


LYS 


B 


185 


83.402 


25.694 


10.278 


1.00 


43.29 


0 


ATOM 


4913 


N 


ARG 


B 


186 


65.265 


24.751 


9.918 


1.00 


37.63 


N 


ATOM 


4914 


CA 


ARG 


B 


186 


86.025 


25.682 


10.690 


1.00 


35.62 


c 


ATOM 


4915 


CB 


ARG 


B 


186 


87.417 


25.226 


10.781 


1.00 


33.61 


C 


ATOM 


4916 


CG 


ARG 


B 


186 


86.041 


25.065 


9.601 


1.00 


29.63 


c 


ATOM 


4917 


CD 


ARG 


B 


166 


89.425 


24.576 


9.537 


1.00 


26.57 


c 


ATOM 


4918 


NE 


ARG 


B 


186 


90.297 


25.092 


10.546 


1.00 


39.46 


N 


ATOM 


4919 


CZ 


ARG 


B 


186 


91.107 


26.192 


10.472 


1.00 


47.38 


C 


ATOM 


4920 


NH1 


ARG 


B 


186 


91.071 


26.904 


9.429 


1.00 


37.49 


N 


ATOM 


4921 


NH2 


ARG 


B 


186 


91.942 


26.575 


11.520 


1.00 


48.25 


N 


ATOM 


4922 


C 


ARG 


B 


186 


86.141 


27.066 


10.035 


1.00 


39.08 


C 


ATOM 


4923 


O 


ARG 


B 


186 


85.825 


27.163 


8.867 


1.00 


38.74 


0 


ATOM 


4924 


N 


PHE 


B 


187 


86.553 


28.066 


10.799 


1.00 


39.88 


N 


ATOM 


4925 


CA 


PHE 


B 


187 


86.684 


29.383 


10.390 


1.00 


41.05 


C 


ATOM 


4926 


CB 


PHE 


B 


187 


86.225 


30.372 


11.376 


1.00 


42.60 


c 


ATOM 


4927 


CG 


PHE 


B 


187 


84.699 


30.327 


11.530 


1.00 


39.62 


c 


ATOM 


4928 


CD1 


PHE 


B 


187 


84.126 


29.770 


12.666 


1.00 


36.18 


c 


ATOM 


4929 


CE1 


PHE 


B 


187 


82.774 


29.661 


12.854 


1.00 


37.51 


c 


ATOM 


4930 


CZ 


PHE 


B 


187 


81.943 


30.130 


11.904 


1.00 


41.90 


c 


ATOM 


4931 


CE2 


PHE 


B 


187 


82.545 


30.731 


10.728 


1.00 


42.53 


c 


ATOM 


4932 


CD2 


PHE 


B 


187 


83.944 


30.794 


10.601 


1.00 


37.81 


c 


ATOM 


4933 


C 


PHE 


B 


187 


88.069 


29.555 


10.254 


1.00 


44.70 


c 


ATOM 


4934 


O 


PHE 


B 


187 


88.755 


28.653 


10.438 


1.00 


49.78 


0 


ATOM 


4935 


N 


ASP 


B 


188 


88.519 


30.686 


9.834 


1.00 


47.01 


N 


ATOM 


4936 


CA 


ASP 


B 


188 


89.940 


30.985 


9.574 


1.00 


49.09 


c 


ATOM 


4937 


CB 


ASP 


B 


188 


90.067 


31.854 


8.206 


1.00 


51.42 


c 



SUBSTITUTE SHEET (RULE 26) 



WO 03/035693 



163/514 



PCT/GB02/04872 



Figure 2 
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gure 2 ... 
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Figure 2 
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Figure 3 
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Figure 3 
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16.890 63.649 

17.783 63.331 

16.077 62.763 

16.231 61.345 

17.342 60.802 

14.945 60.579 

13.980 61.128 

14.924 59.307 

13.728 58.524 

12.766 58.418 

13.070 58.848 

11.591 57.846 

10.598 57.680 

9.466 56.763 

9.797 55.290 

8.635 54.376 

8.808 . 52.981 

8.801 53.012 

10.051 59.005 

9.421 59.049 

10.329 60.089 

9.868 61.421 

9.123 62.077 

7.616 61.800 

6.810 62.766 

5.380 62.469 

4.407 63.334 

4.693 64.579 

, 3.137 62.943 

' 11.005 62.326 

10.838 63.539 

•12.150 61.723 

13.320 62.458 

.14.422 61.516 

14.006 60.784 

15.682 62.292 

16.828 61.381 

.12.997 63.278 
V 12.187 ' 62.866 

13.674 64.408 

13.422 65.265 
14.459 66.361 
14.341 67,079 
13.349 64.578 
14.283 63.911 
12.236 64.815 
12.012 64.273 
10.627 64.638 
10.553 64.585 

9.645 63.676 

13.057 64.766 

13.543 64.014 

13.428 66.030 

14.423 66.558 
15.867 66.452 
16.689 67.309 
16.201 65.423 
17.587 65.279 
17.769 63.985 
19.145 63.876 
19.315 62.585 
20.345 62.454 
18.423 61.689 
18.582 65.285 
19.570 66.011 
18.302 64.469 
19.169 64.364 
18.653 63.291 
19.282 65.676 
20.366 66.245 
18.149 66.146 
18.102 67.390 
16.662 67.941 
16.448 69.207 
17.281 69.384 



3 



26.437 1.00 24.07 

25.653 1.00 26.36 

26.993 1.00 26.27 

26,727 1.00 29.60 

27.589 1.00 28.86 

26.989 1.00 32.25 

27.512 1.00 34.62 

26.618 1.00 34.16 

26.864 1.00 37.75 

25.698 1.00 39.79 

24.582 1.00 39.21 
25.956 1.00 41.76 
24.901 1.00 43.17 
25.382 1.00 44.81 
25.178 1.00 48.61 
25.532 1.00 51.62 
24.909 1.00 53.39 
23.400 1.00 54.40 
24.366 1.00 42.71 
23.307 1.00 42.71 
25.088 1.00 42.78 
24.693 1.00 41.46 
25.871 1.00 43.17 
25.907 1.00 45.43 
25.030 1.00 46.80 
25.078 1.00 48.92 
24.787 1.00 51.22 
24.422 1.00 50.24 
24.849 1,00 52.87 
24.200 1.00 39.70 
24.097 1.00 40.07 
23.891 1.00 36.43 
23.406 1.00 33.74 
22.986 1.00 34.82 
21.725 1.00 35.49 
22.657 1.00 34.75 
22.328 1.00 35.41 
22.169 ' 1.00 31.17 
21.351 1.00 31.21 
22.011 1.00 27.74 
20.666 1.00 26.20 
20.713 1.00 25.71 
19.062 1.00 29.11 
19.523 1.00 25.59 
19.088 1.00 26.16 
18.847 1.00 25.30 
17.529 1.00 24.68 
17.046 1.00 22.85 
15.551 1.00 22.21 
17.646 1.00 23.03 
16.539 1.00 26. 7B 

15.699 1.00 28.82 
16.645 1.00 27.44 
15.734 1.00 29.21 
16.187 1.00 30,37 
15,827 1.00 31.48 
16.964 1.00 30.25 
17.424 1.00 29.95 
18.224 1.00 32.05 
18.906 1.00 38.82 
19.688 1,00 42.64 
20.413 1.00 43.98 

19.583 1.00 43.52 
16.261 1.00 27.57 
16.279 1.00 25.24 
15.252 1.00 27.48 
14.095 1.00 26.05 
13.165 1.00 27.14 
13.346 1.00 26.34 
13.220 1.00 27.58 
12.834 1.00 26.54 
12.068 1.00 25.44 
11.867 1.00 24.53 
11.230 1.00 23.71 

9.977 1.00 25.27 
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Figure 3 

71.236 29.973 53.983 1.00 56.55 

71.160 31.371 54.616 1.00 57.86 

69.785 31.954 54.331 1.00 59.29 

71.407 31.297 56.117 1.00 57.96 

72.358 27.907 53.180 1.00 54.17 

71.497 27.107 53.551 1.00 54.26 

73.157 27.694 52.136 1.00 54.12 

73.058 26.475 51.346 1.00 53.55 

73.388 26.766 49.873* 1.00 55.94 

72.817 25.739 48.902 1.00 58.48 

72.969 26.166 47.439 1.00 60.2? 

72.363 25.119 46.502 1.00 61.43 

70.941 24.786 46.868 1.00 61.31 

73.992 25.407 51.902 1.00 52.23 

75.189 25.631 52.052 1.00 52.92 

73.438 24.240 52.258 l.OO 51.04 

71.997 23.959 52.411 1.00 51.22 
74.239 23.144 52.800 1.00 49.79 
73.225 22.367 53.629 1.00 49.81 
71.988 22.491 52.801 1.00 50.73 
74.807 22.325 51.649 1.00 48.08 
74.114 22.060 50.667 1.00 48.26 
76.071 21.937 51.769 1.00 45.88 
76.738 21.153 50.735 1.00 44.14 
77.890 21.949 50.080 1.00 44.72 
78.558 21.108 49.012 1.00 45.82 
77.363 23.249 49.478 1.00 44.97 
76.429 23.064 48.325 1.00 46.05 
77.335. 19.890 51.337 1.00 41.82 
77.712 19.873 52.504 1.00 42.17 
77.412' 18.-835 50.536 1.00 39.71 

77.998 17.579 50.982 1.00 38.49 
77.025 16.408 50.796 1.00 37.65 
77.523 15.185 51.538 1.00 38.54 
75.660 - 16.805 51.282 1.00 38.02 
79.229' 17.367 50.105 1.00 37.21 
79.122 17.244 48.883 1.00 38.14 
80.396. 17.361 .50.742 1.00 35.14 
B1.673 171203 50.052 1.00 34.54 
82.748 18.110 50.700 1.00 34.09 
84.040 18.085 49.905 1.00 33.51 
82.216 19.507 50.834 1.00 34.58 
82.158 15.753 50.138 1.00 33.28 
82.262 15.199 51.224 1.00 35.03 
82.432 15.130 49.001 1.00 30.43 
82.933 13.770 49.013 1.00 28.19 
82.283 12.955 47.915 1.00 27.08 
80.774 13.112 47.901 1.00 27.37 
80.204 12.284 46.776 1.00 28.13 
80.174 12.694 49.228 1.00 28.04 
84.425 13.894 48.774 1.00 28.28 
84.847 14.443 47.759 1.00 28.32 
85.233 13.382 49.692 1.00 30.59 
86.655 13.546 49.519 1.00 32.31 
87.293 13.987 50.818 1.00 33.23 
88.596 14.684 50.616 1-00 34.53 
88.880 15.861 50.007 1.00 37.11 
89.812 14.136 50.971 1.00 36.57 
90.787 14.942 50.582 1.00 37.86 
90.248 15.995 49.993 1.00 38.97 
87.489 12.429 48.950 1.00 34.95 
87.884 12.472 47.792 1.00 36.29 
87.794 11.431 49.766 1.00 36.88 
88.636 10.342 49.291 1.00 38.70 
88.080 9.580 48.106 1.00 39.18 
86.864 9.451 47.969 1.00 40.93 
88.961 9.072 47.249 1.00 39,24 
88.525 8.296 46.083 1.00 40.08 
89.697 7.500 45.479 1.00 41.68 
89.269 6.383 44.523 1.00 42.83 
88.914 6.656 43.200 1.00 43.76 
8B.487 5.638 42.322 1.00 45.59 
89.191 5.062 44.951 1.00 43.68 
88.763 4.035 44,086 1.00 46.67 
88.411 4.330 42.773 1.00 46.33 
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Figure 3 
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Figure 3 
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89.795 36.392 42.546 1.00 
86.264 36.2B0 43.587 1.00 
85.566 35.459 44.173 1.00 
85.920 37.556 43.483 1.00 
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82.855 41.163 43.191 1.00 
82.710 41.788 41.928 1.00 
81.919 41.453 44.206 1.00 
84.514 40.461 41.847 1.00 
83.734 41.343 41.132 1.00 
81.668 42.695 41.649 1.00 
80.882 42.355 43.928 1.00 

80.769 42.962 42.656 1.00 
83.429 37.358 43.558 1.00 
82.343 37.912 43.689 1.00 
83.551 36.191 42.935 1.00 
82.380 35.512 42.384 1.00 
82.748 34.612 41.176 1.00 
81.662 33.565 40.937 1.00 
82.867 35.466 39.920. 1.00 
81.524 35.967 39.406 1.00 
81.728 34.649 43.460 1.00 
80.499 .34.535 43.533 1.00 
82.568 j34. 032 44.286 1.00 
82.094; ;33;191 45.370 1.00 
83*274/. 32.547 46.094 1.00 
83,390 ; .31.071 45.814 1.00 
83.337 .30.765 44.333 1.00 
84.234 -31.139 43..576 1.00 
82 .270 . .30 . 092 43 . 907 1 . 00 
81.298: 34.044 46.338 1.00 
80^:317 , 33.588 46.930 1.00 
81.724 35.290 46.501 1.00 
81.011 36.178 47.393 1.00 
81.642 37.591 47.409 1.00 
80.761 38.552 48.184 1.00 
83.018 37.527 48.069 1.00 
79.562 36.233 46.912 1.00 
78.674 36.623 47.660 1.00 
79.325 35.821 45.669 1.00 
77.970 35.812 45.140 1.00 
77.965 35.796 43.617 1.00 
77.984 37.158 42.984 1.00 
77.147 37.449 41.925 1.00 
77.155 38.690 41.325 1.00 
78.843 38.149 43.435 1.00 
78.863 39.407 42.839 1.00 
78.013 39.668 41.784 1.00 
77.996 40.908 41.177 1.00 
77.194 34.602 45.631 1.00 
76.129 34.730 46.241 1.00 
77.722 33.418 45.360 1.00 
77.049 32.186 45.753 1.00 
77.837 31.000 45.223 1.00 
78.061 31.089 43.742 1.00 
77.110 31.109 42.953 1.00 
79.325 31.162 43.346 1.00 
76.874 32.0B7 47.252 1.00 
75.946 31.434 47.742 1.00 
77.767 32.749 47.974 1.00 
77.701 32.755 49.422 1.00 
79.055 33.189 50.004 1.00 
79.982 32,011 50.281 1.00 
79.679 31.152 51.117 1.00 
81.114 31.961 49.575 1.00 
76.576 33.680 49.909 1.00 
75.986 33.461 50.968 1.00 
76.259 34,696 49.113 1.00 
75.210 35;628 49.474 1.00 
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Figure 3 
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74.063 35.982 12.097 1.00 58.62 B 

75.003 34.897 10.412 1.00 62.01 B 

71.687 31.298 11.794 1.00 50.34 B 

71.05B 31.002 10.786 1.00 51.45 B 

72.695 30.566 12.257 1.00 49.68 B 

73.177 29.379 11.575 1.00 48.81 B 
74.377 28.806 12.334 1.00 47.41 B 
75.729 29.065 11.686 1.00 47.15 B 
76.109 30.543 11.631 1.00 47.21 B 
77.500 30.764 10.994 1.00 47.47 B 
77.587 30.428 9.528 1.00 49.50 B 
72.124 28.293 11.359 1.00 48.74 B 
72.071 27.686 10.298 1.00 49.45 B 
71.278 28.046 12.352 1.00 49.27 B 
70.262 27.013 12.214 1.00 51.08 B 
69.589 26.698 13.547 1.00 50.98 B 
70.648 26.525 14.616 1.00 52.15 B 
68.606 27.804 13.905 1.00 52.85 B 

69.178 27.424 11.237 1.00 53.35 B 
68.189 26.711 11.059 1.00 53.14 B 
69.338 28.593 10.628 1.00 55.02 B 
68.365 29.046 9.652 1.00 56.65 B 
68.080 30.544 9.819 1.00 57.72 B 
67.178 30.862 11.006 1.00 59.78 B 
66.557 32.250 10.876 1.00 61.55 B 
65.551 32.542 11.998 1.00 61.89 B 
64.464 31.511 12.100 1.00 60.53 B 
68.962 28.740 8.288 1.00 56.74 B 
68.267 28.283 7.389 1.00 55.85 . B 
70.266 28.969 8.156 1.00 58.18 B 

70.972 28.686 6.915 1.00 60.53 B 
72.413 29.133 7.010 1.00 61.19 B 
72.578 30.587 7.279 i.00 63.62 B 
74.023 30.952 7.464 1.00 66.37 B 
74.297 32.158 7.666 1.00 67.56 B 
74.882 30.033 7.411 1.00 67.44 B 

70.973 27.188 6.695 1.00 61.24 B 
71.083 26.702 5.570 1.00 61.63 B 
70.871 26.467 7.801 1.00 61.79 B 
70.860 25.021 7.790 1.00 62.88 B 
71.377 24.485 9.124 1.00 61.61 B 
72.865 24.365 9.190 1.00 60.62 B 
73.655 23.298 9.440 1.00 59.65 B 
73.716 25.434 8.994 1.00 60.52 B 
74.966 25.026 9.121 1.00 59.57 B 
74.956 23.733 9.391 1.00 60.02 B 
69.474 24.463 7.518 1.00 64.15 B 
69.340 23.496 6.784 1.00 65.83 B 
68.442 25.057 8.111 1.00 64.80 B 
67.087 24.573 7.885 1.00 65.37 B 
66.064 25.438 8.622 1.00 64.01 B 
65.800 25.008 10.053 1.00 64.90 B 
64.853 25.964 10.760 1.00 65.23 B 
63.764 25.571 11.188 1.00 65.86 B 
65.269 27.233 10,887 1.00 65.65 B 
66.783 24.572 6.393 1.00 66.52 B 
66.078 23.691 5.899 1.00 66.18 B 
67.328 25.544 5.669 1.00 68.69 B 
67.097 25.634 4.233 1.00 71.12 B 
67.469 27.041 3.732 1.00 73.22 B 
66.768 27.451 2.433 1.00 77.82 B 
67.355 26.782 1.203 1.00 80.76 B 
66.668 26.745 0.150 1.00 B2.61 B 
68.511 26.303 1.285 1.00 81.73 B 
67.873 24.554 3.460 1.00 71.89 B 
67.267 23.690 2.822 1.00 72.66 B 
69.205 24.598 3.539 1.00 72.93 B 
70.105 23.657 2.841 1.00 73.01 B 
71.498 24.306 2.691 1.00 73.07 B 
72.358 23.597 1.812 1.00 72.63 B 
70.247 22.311 3.571 1.00 72.77 B 
71.195 21.557 3.336 1.00 71.98 B 
69.307 22.018 4.456 1.00 73.82 B 
69.337 20.777 5.209 1.00 74.60 B 
68.404 20.871 6.428 1.00 78.04 B 



0 




0 




c 




0 




N 




C 




C 




C 




C 




C 




N 




C 




0 




N 




c 




c 




c 




c 




c 




0 




N 




C 




C 




C 




c 




c 




N 




C 




0 




N 
C 


a-' 


C 


■ : : l*> 


C 


\„ - 


C 




0 


. : o: 


0 




C 




0 




N 




C 




C 




c 




c 




N 




C 




N 




C 




0 




N 




C 




C 




C 




C 




0 




N 




C 




0 




N 




C 




c 




c 




c 




0 




0 




c 




0 




N 




c 




c 




0 




c 




0 




N 




C 




C 





SUBSTITUTE SHEET (RULE 26) 



WO 03/035693 



ATOM 


5547 


CG 


MET 


B 255 


ATOM 


5548 


SD 


MET 


B 255 


ATOM 


5549 


CE 


MET 


B 255 


ATOM 


5550 


C 


MET 


B 255 


ATOM 


5551 


0 


MET 


B 255 


ATOM 


5552 


N 


ASP 


B 256 


ATOM 


5553 


CA 


ASP 


B 256 


ATOM 


5554 


CB 


ASP 


B 256 


ATOM 


5555 


CG 


ASP 


B 256 


ATOM 


5556 


OD1 


ASP 


B 256 


ATOM 


5557 


OD2 ASP B 256 


ATOM 


5558 


C 


ASP B 256 


ATOM 


5559 


0 


ASP 


B 256 


ATOM 


5560 


N 


MET 


B 257 


ATOM 


5561 


CA 


MET 


B 257 ' 


ATOM 


5562 


CB 


MET 


B 257 


ATOM 


5563 


CG 


MET 


B 257 


ATOM 


5564 


SD 


MET 


B 257 


ATOM 


5565 


CE 


MET 


B 257 


ATOM 


5566 


C 


MET 


B 257 


ATOM 


5567 


0 


MET 


B 257 


ATOM 


5568 


N 


ASN 


B 258. 


ATOM 


5569 


CA 


ASH 


B 258 


ATOM 


5570 


CB 


ASN 


B 258 


ATOM 


5571 


CG 


ASM 


B 258 


ATOM 


5572 


OD1 


ASN 


B 258 


ATOM 


5573 


ND2 


ASN 


B 258 


ATOM 


5574 


C 


ASN 


B 258 


ATOM 


5575 


0 


ASN 


B 258 


ATOM 


5576 


N 


ASN 


B 259 


ATOM 


5577 


CA 


ASN 


B 259 


ATOM 


5578 


CB 


ASN 


B 259 


ATOM 


5579 


CG 


ASN 


B 259 


ATOM 


5580 


OD1 


ASN 


B 259 


ATOM 


5581 


ND2 


ASN 


B 259 


ATOM 


5562 


C 


ASN 


B 259 


ATOM 


5583 


0 


ASN B 259. 


ATOM 


5584 


N 


PRO 


B 260 


ATOM 


5585 


CD 


PRO B 260 


ATOM 


5586 


CA 


PRO B 260 


ATOM 


5587 


CB 


PRO 


B 260 


ATOM 


5588 


CG 


PRO B 260 


ATOM 


5589 


C 


PRO B 260 


ATOM 


5590 


0. 


PRO 


B 260 


ATOM 


5591 


N 


GLN 


B 261 


ATOM 


5592 


CA 


GLN 


B 261 


ATOM 


5593 


CB 


GLN 


B 261 


ATOM 


5594 


CG 


GLN 


B 261 


ATOM 


5595 


CD 


GLN 


B 261 


ATOM 


5596 


OE1 


GLN 


B 261 


ATOM 


5597 


NE2 


GLN 


B 261 


ATOM 


5598 


C 


GLN B 261 


ATOM 


5599 


0 


GLN 


B 261 


ATOM 


5600 


N 


ASP 


B 262 


ATOM 


5601 


CA 


ASP 


B 262 


ATOM 


5602 


CB 


ASP 


B 262 


ATOM 


5603 


CG 


ASP 


B 262 


ATOM 


5604 


OD1 


ASP 


B 262 


ATOM 


5605 


OD2 


ASP 


B 262 


ATOM 


5606 


C 


ASP 


B 262 


ATOM 


5607 


0 


ASP 


B 262 


ATOM 


5608 


N 


PHE 


B 263 


ATOM 


5609 


CA 


PHE 


B 263 


ATOM 


5610 


CB 


PHE 


B 263 


ATOM 


5611 


CG 


PHE 


B 263 


ATOM 


5612 


CD1 


PHE 


B 263 


ATOM 


5613 


CD2 


PHE 


B 263 


ATOM 


5614 


CE1 


PHE 


B 263 


ATOM 


5615 


CE2 


PHE 


B 263 


ATOM 


5616 


CZ 


PHE 


B 263 


ATOM 


5617 


C 


PHE 


B 263 


ATOM 


5618 


0 


PHE 


B 263 


ATOM 


5619 


N 


ILE 


B 264 


ATOM 


5620 


CA 


ILE 


B 264 


ATOM 


5621 


CB 


ILE B 264 



271/514 
Figure 3 



67.861 19.542 

66.224 19.105 

65.065 20.025 

68.926 19.630 

67.889 19.688 

69,764 18.596 

69.542 17.392 

70.670 17.230 

70.518 15.974 

69.382 15.449 

71.540 15.521 

69.481 16.181 

70.473 15.810 

68.286 15.601 

68.058 14.417 

66.563 14.052 

65.605 15.153 

65.079 15.039 

64.097 16.620 

68.838 13.331 

69.107 13.477 

69.210 12.257 

69.961 11.176 

69.333 10.830 

67.811 10.649 

67.313 9.696 

67.068 11.565 

71.418 ' 11.593 

72.234 10.784. 

71.733 .12.852 

73.081 .;13, 365 

73.188 13.964 

73.234 ;12.914 

74.224 12.186 

72.161- 12.814 

73.537 14.405 

74.125 ..,15.425 

73.293 14.156 

72.826, .12.917 

73.720 ^15.135 

73.254 14.506 

73.412 13.045 

75.230 15.329 

75.960 14.413 

75.690 16.520 

77.111 16.809 

77.464 17.707 

77.125 17.154 

78.156 17.583 

79.324 17.184 

77.739 18.410 

77.634 17.455 

78.845 17.600 

76.741 17.877 

77.160 18.483 

77.581 19.947 

76.466 20.835 

76.832 21.824 

75.263 20.567 

76.132 18.389 

75.011 17.931 

76.522 18.833 

75.639 18.759 

76.202 19.536 

75.559 19.188 

75.96B 18.064 

74.533 19.964 

75.361 17.717 

73.920 19.620 

74.336 18.495 

74.265 19.300 

73.259 18.666 

74.215 20.485 

72.927 21.100 

73.118 22.433 



6.969 1.00 81.63 

6.257 1.00 87.34 

7.397 1.00 85.70 

4.308 1.00 73.64 

3.629 1.00 72.83 

4.299 1.00 71.90 

3.506 1.00 69.92 

2.469 1.00 70.72 

1.614 1.00 70.58 

1.492 1.00 70.64 

I. 048 1.00 71.56 
4.434 1.00 68.11 
5.067 1.00 66.52 
4.530 1.00 65.51 
5.346 1.00 65.48 
5.380 1.00 67.23 
5.885 1.00 68.84 
7.658 1.00 71.37 
7.814 1.00 71.82 
4.609 1.00 64.97 
3.414 1.00 66.71 
5.299 1.00 65.17 
4.654 1.00 64.71 
3.291 1.00 66.50 
3,358 1.00 68.28 
3.973 1.00 69.64 
2.718 1.00 66.98 
4.425 1.00 63.11 
3.979 1.00 63.10 
4.725 1.00 61.91 
4.524 1.00 60.39 
3.130 1.00 61.49 
2.064 1.00 62.22 
1.941 1.00 64.39 
1.281 1.00 62.03 
5.533 1.00 59.45 
5.161 1.00 59.36 
6.825 1.00 57.71 
7.472 1.00 57.55 
7.821 1.00 56.35 
9.125 1.00 56.68 
8.858 1.00 57.06 
7.775 1.00 54.67 
7.402 1.00 53.80 
8.152 1.00 53.48 
8.143 1.00 53.59 
6.966 1.00 55.33 
5.603 1.00 59.00 
4.573 1.00 61.42 
4.658 1.00 62.22 
3.603 1.00 62.78 
9.420 1.00 52.56 
9.578 1.00 53.23 

10.313 1.00 50.51 

II. 586 1.00 48.20 
11.400 1.00 48.05 
10.848 1.00 47.66 
10.160 1.00 46.01 
11.10B 1.00 45.97 
12.702 1.00 47.06 
12.496 1.00 46.54 
13.889 1.00 45.53 
15.032 1.00 44.41 
16.207 1.00 44.42 
17.509 1.00 44.28 
18.207 1.00 45.54 
18.031 1.00 44.61 
19.410 1.00 46.26 
19.237 1.00 44.30 
19.926 1.00 43.94 
14.727 1.00 44.75 
15.027 1.00 44.07 
14.133 1.00 44.93 
13.820' 1.00 45.85 
13.098 1.00 44.47 



PCT/GB02/04872 



B 


C 


B 


S 


B 


C 


B 


C 


B 


O 


B 


N 


B 


C 


B 


C 


B 


C 


B 


O 


B 


0 


B 


C 


B 


0 


B 


N 


B 


C 


B 


C 


B 


C 


B 


s 


B 


c 


B 


c 


B 


0 


B 


N 


B 


c 


B 


c 


B 


c 


B 


0 


B 


N 


B 


c 


B 


0 


B 


N 


B 


c 


B 


c 


B 


c 


B 


0 


B 


N 


B 


c 


B 


O 


B 


N 


B 


c 


B 


c 


B 


c 


B 


c 


B 


c 


B 


O 


B 


N 


B 


c 


B 


c 


B 


c 


B 


c 


B 


O 


B 


N 


B 


c 


B 


O 


B 


N 


B 


c 


B 


c 


B 


c 


B ' 


O 


B 


O 


B 


C 


B 


O 


B 


N 


B 


c 


B 


c 


B 


c 


B 


c 


B 


c 


B 


C 


B 


C 


B 


c 


B 


C 


B 


O 


B 


N 


B 


c 


B 


c 



SUBSTITUTE SHEET (RULE 26) 



WO 03/035693 



272/514 



PCT/GB02/04872 



Figure 3 



ATOM 


5622 


CG2 


I LB 


B 


264 


71.773 


23.017 


12.730 


1.00 


45.20 


B 


C 


ATOM 


5623 


CGI 


ILE 


B 


264 


73.862 


23.400 


14.019 


1.00 


43.05 


B 


C 


ATOM 


5624 


CD1 


ILE 


B 


264 


74.606 


24.470 


13.283 


1.00 


40.57 


' B 


C 


ATOM 


5625 


c 


ILE 


B 


264 


72.030 


20.193 


12.987 


1:00 


47.22 


B 


c 


ATOM 


5626 


0 


ILE 


B 


264 


70.926 


19.854 


13.418 


1.00 


47.36 


B 


0 


ATOM 


5627 


N 


ASP 


B 


265 


72.496 


19.807 


11.800 


1.00 


48.11 


B 


N 


ATOM 


5628 


CA 


ASP 


B 


265 


71.708 


18.930 


10.932 


1.00 


48.71 


B 


C 


ATOM 


5629 


CB 


ASP 


B 


265 


72.571 


18.311 


9.815 


1.00 


51.99 


B 


C 


ATOM 


5630 


CG 


ASP 


B 


265 


72.973 


19.310 


8.734 


1.00 


55.54 


B 


C 


ATOM 


5631 


OD1 


ASP 


B 


265 


72.097 


20.055 


8.235 


1.00 


58.02 


B 


0 


ATOM 


5632 


OD2 


ASP 


B 


265 


74.173 


19.333 


B.366 


1.00 


57.00 


B 


0 


ATOM 


5633 


C 


ASP 


B 


265 


71.099 


17.804 


11.766 


1.00 


47.73 


B 


C 


ATOM 


5634 


0 


ASP 


B 


265 


69.883 


17.654 


11.812 


1.00 


48.27 


B 


0 


ATOM 


5635 


N 


CYS 


B 


266 


71.941 


17.025 


12.442 


1.00 


47.92 


B 


N 


ATOM 


5636 


CA 


CYS 


B 


266 


71.447 


15.907 


13.243 


1.00 


47.92 


B 


C 


ATOM 


5637 


CB 


CYS 


B 


266 


72.589 


15.224 


13.986 


1.00 


46.83 


B 


C 


ATOM 


5638 


SG 


CYS 


B 


266 


74.050 


14.907 


12.993 


1.00 


45.03 


B 


s 


ATOM 


5639 


C 


CYS 


B 


266' 


70.400 


16.352 


14.251 


1.00 


48.66 


B 


C 


ATOM 


5640 


O 


CYS 


B 


266 


69.517 


15.581 


14.626 


1.00 


49.40 


B 


0 


ATOM 


5641 


N 


PHE 


B 


267 


70.510 


17.592 


14.708 


1.00 


49.73 


B 


N 


ATOM 


5642 


CA 


PHE 


B 


267 


69.547 


18.101 


15.662 


1.00 


51.00 


B 


C 


ATOM 


5643 


CB 


PHE 


B 


267 


70.099 


19.314 


16.428 


1.00 


50.24 


B 


C 


ATOM 


5644 


CG 


PHE 


B 


267 


69.390 


19.581 


17.729 


1.00 


49.19 


B 


C 


ATOM 


5645 


CD1 


PHE 


B 


267 


69.842 


19.016 


18.914 


1.00 


48.78 


B 


C 


ATOM 


5646 


CD2 


PHE 


B 


267 


68.223 


20.329 


17.753 


1.00 


48.36 


B 


C 


ATOM 


5647 


CE1 


PHE 


B 


267 


69.145 


19.199 


20.088 


1.00 47.51 


B 


C 


ATOM 


5648 


CE2 


PHE 


B 


267 


67.523 


20.509 


18.928 


1.00 


48.39 


B 


C 


ATOM 


5649 


cz 


PHE 


B 


267 


67.981 


19.940 


20.089 


1;'00 ; 


47.78 


B 


C 


ATOM 


5650 


c 


PHE 


B 


267 


68.311 


18.515 


14.876 


i-oo 


52.53 


B 


C 


ATOM 


5651 


O 


PHE 


B 


267 


67.189 


18.466 


15.387 


,1;00 


53.33 


B 


0 


ATOM 


5652 


N 


LEU 


B 


268 


68.508 


18.928 


13.630* 


'i.oo' 


53.12 


B 


N 


ATOM 


5653 


CA 


LEU 


B 


268 


67.369 


19.333 


12.822 


i;oo- 


53.68 


B 


c 


ATOM 


5654 


CB 


LEU 


B 


268 


67.840 


20.068 


11.562 


1V00'53.59 


B 


c 


ATOM 


5655 


CG 


LEU 


B 


268 


67.761 


21.599 


11.676 


i;00' 54.06 


B 


C 


ATOM 


5656 


CD1 


LEU 


B 


268 


68.381 


22.066 


12.972 


l':00 '55 .12 


B 


C 


ATOM 


5657 


CD2 


LEU 


B 


268 


68.459 


22.245 


10.487 ' 


1*00*55.09 


B 


c 


ATOM 


5658 


C 


LEU 


B 


268 


66.463 


18.148 


12.456 


l;00 


55.44 


B 


C 


ATOM 


5659 


O 


LEU 


B 


268 


65.236 


18.282 


12.502* 


i:oo 


55.20 


B 


0 


ATOM 


5660 


N 


MET 


B 


269 


67.069 


16.999 


12.130 


1,00: 


56.98 


B 


N 


ATOM 


5661 


CA 


MET 


B 


269 


66.320 


15.794 


11.760 


1.00 


59.34 


. B 


C 


ATOM 


5662 


CB 


MET 


B 


269 


67.140 


14.876 


10.823 


1.00 


60.05 


B 


C 


ATOM 


5663 


CG 


MET 


B 


269. 


68.458 


14.316 


11.376 


1.00 


62.72 


B 


C 


ATOM 


5664 


SD 


MET 


B 


269 


69.254 


13.058 


10.261 


1.00 


66.06 


B 


S 


ATOM 


5665 


CE 


MET 


B 


269 


69.481 


13.994 


8.727 


1.00 


64.43 


B 


C 


ATOM 


5666 


C 


MET 


B 


269 


65.838 


15.005 


12.967 


1.00 


59.39 


B 


c 


ATOM 


5667 


0 


MET 


B 


269 


64.964 


14.132 


12.870 


1.00 


58.85 


B 


0 


ATOM 


5668 


N 


LYS 


B 


270 


66.411 


15.308 


14.116 


1.00 


60.86 


B 


N 


ATOM 


5669 


CA 


LYS 


B- 


270 


65.984 


14.625 


15.313 


1.00 


63.21 


B 


C 


ATOM 


5670 


CB 


LYS 


B 


270 


67.010 


14.811 


16.425 


1.00 


62.61 


B 


C 


ATOM 


5671 


CG 


LYS 


B 


270 


66.573 


14.230 


17.750 


1.00 


62.78 


B 


C 


ATOM 


5672 


CD 


LYS 


B 


270 


66.429 


12.716 


17.711 


1.00 


62 . 60 


B 


C 


ATOM 


5673 


CE 


LYS 


B 


270 


65.886 


12.219 


19.052 


1.00 


62.44 


B 


C 


ATOM 


5674 


NZ 


LYS 


B 


270 


65.984 


10.741 


19.250 


1.00 


61.09 


B 


N 


ATOM 


5675 


C 


LYS 


B 


270 


64.627 


15.224 


15.710 


1.00 


66.08 


B 


C 


ATOM 


5676 


0 


LYS 


B 


270 


63.793 


14.540 


16.309 


1.00 


66.10 


B 


0 


ATOM 


5677 


N 


MET 


B 


271 


. 64.402 


16.492 


15.357 


1.00 


68.08 


B 


N 


ATOM 


5678 


CA 


MET 


B 


271 


63.136 


17.168 


15.661 


1.00 


69.76 


B 


C 


ATOM 


5679 


CB 


MET 


B 


271 


63.212 


18.665 


15.343 


1.00 


70.16 


B 


C 


ATOM 


5680 


CG 


MET 


B 


271 


64.148 


19.456 


16.235 


1.00 


71.08 


B 


C 


ATOM 


5681 


SD 


MET 


B 


271 


63.955 


21.264 


16.059 


1.00 


71.85 


B 


s 


ATOM 


5682 


CE 


MET 


B 


271 


64.973 


21.559 


14.616 


1.00 


70.30 


B 


C 


ATOM 


5683 


C 


MET 


B 


271 


62.026 


16.566 


14.819 


1.00 


71.34 


B 


C 


ATOM 


5684 


0 


MET 


B 


271 


60.918 


16.316 


15.302 


1.00 


70.75 


B 


0 


ATOM 


5685 


N 


GLU 


B 


272 


62.339 


16.350 


13.544 


1.00 


73.69 


B 


N 


ATOM 


5686 


CA 


GLU 


B 


272 


61.392 


15.782 


12.598 


1.00 


76.77 


B 


C 


ATOM 


5687 


CB 


GLU 


B 


272 


61.943 


15.877 


11.159 


1.00 


77.75 


B 


C 


ATOM 


5688 


CG 


GLU 


B 


272 


60.904 


15.649 


10.033 


1.00 


80.19 


B 


C 


ATOM 


5689 


CD 


GLU 


B 


272 


60.265 


16.941 


9.494 


1.00 


82.08 


B 


C 


ATOM 


5690 


OE1 


GLU 


B 


272 


59.868 


17.818 


10.304 


1.00 


83.11 


B 


0 


ATOM 


5691 


OE2 


GLU 


B 


272 


60.148 


17.074 


8.248 


1.00 


82.13 


B 


0 


ATOM 


5692 


C 


GLU 


B 


272 


61.085 


14.326 


12.982 


1.00 


78.02 


B 


C 


ATOM 


5693 


O 


GLU 


B 


272 


60.037 


13.801 


12.615 


1.00 


78.91 


B 


0 


ATOM 


5694 


N 


LYS 


B 


273- 


61.984 


13.665 


13.713 


1.00 


79.21 


B 


N 


ATOM 


5695 


CA 


LYS 


B 


273 


61.698 


12.290 


14.134 


1.00 


81.62 . 


B 


C 


ATOM 


5696 


CB 


LYS 


B 


273 


62.971 


11.560 


14.611 


1.00 


81.40 


B 


C 



SUBSTITUTE SHEET (RULE 26) 



WO 03/035693 



ATOM 


5697 


CG 


LYS 


B 273 


ATOM 


5698 


CD 


LYS 


B 273 


ATOM 


5699 


CB 


LYS 


B 273 


ATOM 


5700 


HZ 


LYS 


B 273 


ATOM 


5701 


C 


LYS 


B 273 


ATOM 


5702 


0 


LYS 


B 273 


ATOM 


5703 


N 


GLU 


B 274 


ATOM 


5704 


CA 


GLU 


B 274 


ATOM 


5705 


CB 


GLU 


B 274 


ATOM 


5706 


CG 


GLU 


B 274 


ATOM 


5707 


CD 


GLU 


B 274 


ATOM 


5708 


OE1 


GLU 


B 274 


ATOM 


5709 


OE2 


GLU 


B 274 


ATOM 


5710 


C 


GLU 


B 274 


ATOM 


5711 


0 


GLU 


B 274 


ATOM 


5712 


N 


LYS 


B 275 


ATOM 


5713 


CA 


LYS 


B 275 


ATOM 


5714 


CB 


LYS 


B 275 


ATOM 


5715 


CG 


LYS 


B 275 


ATOM 


5716 


CD 


LYS 


B 275 


ATOM 


5717 


CE 


LYS 


B 275 


ATOM 


5718 


NZ 


LYS 


B 275 


ATOM 


5719 


C 


LYS 


B 275 


ATOM 


5720 


0 


LYS 


B 275 


ATOM 


5721 


N 


HIS 


B 276 


ATOM 


5722 


CA 


HIS 


B 276 


ATOM 


5723 


CB 


HIS 


B 276 


ATOM 


5724 


CG 


HIS 


B 276 


ATOM 


5725 


CD2 


HIS 


B 276 


ATOM 


5726 


ND1 


HIS 


B 276 


ATOM 


5727 


CGI 


HIS 


B 276 


ATOM 


5728 


NE2 


HIS 


B 276 


ATOM 


5729 


C 


HIS 


B 276 


ATOM 


5730 


0 


HIS B 276 


ATOM 


5731 


N 


ASN 


B 277 


ATOM 


5732 


CA 


ASN 


B 277 


ATOM 


5733 


CB 


ASN 


B 277 


ATOM 


5734 


CG 


ASN 


B 277 


ATOM 


5735 


OD1 


ASN 


B 277 


ATOM 


5736 


ND2 


ASN 


B 277 


ATOM 


5737 


C 


ASN 


B 277 


ATOM 


5738 


0 


ASN 


B 277 


ATOM 


5739 


N 


GLN 


B 278 


ATOM 


5740 


CA 


GLN 


B 278 


ATOM 


5741 


CB 


GLN 


B 278 


ATOM 


5742 


CG 


GLN 


B 278 


ATOM 


5743 


CD 


GLN 


B 278 


ATOM 


5744 


OE1 


GLN 


B 278 


ATOM 


5745 


NE2 


GLN 


B 278 


ATOM 


5746 


C 


GLN 


B 278 


ATOM 


5747 


0 


GLN 


B 278 


ATOM 


5748 


N 


PRO 


B 279 


ATOM 


5749 


CD 


PRO B 279 


ATOM 


5750 


CA 


PRO B 279 


ATOM 


5751 


CB 


PRO 


B 279 


ATOM 


5752 


CG 


PRO 


B 279 


ATOM 


5753 


C 


PRO 


B 279 


ATOM 


5754 


0 


PRO 


B 279 


ATOM 


5755 


N 


SER 


B 280 


ATOM 


5756 


CA 


SER 


B 280 


ATOM 


5757 


CB 


SER 


B 280 


ATOM 


5758 


OG 


SER B 280 


ATOM 


5759 


C 


SER B 280 


ATOM 


5760 


0 


SER B 280 


ATOM 


5761 


N 


GLU 


B 281 


ATOM 


5762 


CA 


GLU 


B 281 


ATOM 


5763 


CB 


GLU 


B 281 


ATOM 


5764 


CG 


GLU 


B 281. 


ATOM 


5765 


CD 


GLU 


B 281 


ATOM 


5766 


OE1 


GLU 


B 281 


ATOM 


5767 


OE2 


GLU 


B 281 


ATOM 


5768 


C 


GLU 


B 281 


ATOM 


5769 


0 


GLU 


B 281 


ATOM 


5770 


N 


PHE 


B 282 


ATOM 


5771 


CA 


PHE 


B 282 



273/514 



.Figure 3 
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Figure 3 
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Figure 3 



112.648 23.527 12.900 1.00 

113.407 22,309 13.105 1.00 

112.534 21.110 12.662 1.00 

111.437 20.863 13.685 1.00 

111.886 21.418 11.285 1.00 

114.785 22.282 12.408 1.00 

114.851 22.385 11.147 1.00 

115.793 22.143 13.156 1.00 

84.057 18.537 29.656 1.00 

83.673 18.710 31.784 1.00 

86.120 18.897 29.977 1.00 

84.233 19.007 27.593 1.00 

81.852 18.831 29.321 1.00 
82.551 18.153 32.325 1.00 
82.881 17.683 33.661 1.00 
84.220 17.838 33.864 1.00 
84.819 18.408 32.649 1.00 
86.811 18.802 31.109 1.00 
88.217 18.925 30.807 1.00 
88.338 19.043 29.479 1.00 
87.022 19.071 28.885 1.00 
85.396 19.244 26.950 1.00 
85.112 19.472 25.546 1.00 
83.792 19.498 25.365 1.00 
83.140 19.239 26.660 1.00 
81.143 19.049 28.212 1.00 
79.712 19.039 28.550 1.00 
79.593 18.619 29.821 1.00 
80.942 18.502 30.403 1.00 
81.225 18.118 31.671 1.00 
86.158 18.572 32.415 1.00 
86.757 19.230 27.590 1.00 
81.792 19.278 26.894 1.00 
85.019 17.724 35.112 1.00 

81.853 16.873 34.514 1.00 
81.126 17.508 35.714 1.00 
80.631 16.370 36.660 1.00 
79.540 15.750 36.172 1.00 
81.186 16.038 37.638 1.00 
89.294 18.594 31.848 1.00 
89.534 19.429 28.639 1.00 
90.648 18.406 28.616 1.00 
86.207 19.898 24.590 1.00 
82.970 19.411 24.090 1.00 
82.224 18.080 23.860 1.00 
78.623 19.215 27.527 1.00 
78.327 18.448 30.673 1.00 

78.234 17.047 31.318 1.00 
76.826 16.714 31.827 1.00 
76.106 15.921 31.309 1.00 
76.505 17.425 32.917 1.00 
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Figure 4 



Table 7 



4.223 83.036 37.035 1.00 15.00 

3.197 83.359 38.023 1.00 15.00 

3.067 84.892 38.208 1.00 15.00 

2.028 85.215 39.273 1.00 15.00 

4.414 B5.493 38.614 1.00 15.00 

4.380 86.998 38.807 1.00 15.00 

1.B33 82.790 37.623 1.00 15.00 

0.973 82.562 38.475 1.00 55.09 

81.352 35.442 46.381 1.00 15.00 

' 80.598 36.164 47.404 1.00 15.00 

81.045 37.646 47.501 1.00 15.00 

80.394 38.317 48.701 1.00 15.00 

82.565 37.732 47.651 1.00 15.00 

83.091 39.143 47.828 1.00 15.00 

79.094 36.111 47.112 1.00 15.00 

78.271 36.250 48.021 1.00 48.67 

1.623 82.622 36.321 1.00 15.00 

0.370 82.084 35.808 l.OO 15.00 

0.244 82.381 34.313 1.00 15.00 

0.287 84.150 33.907 1.00 15.00 

0.283 80.580 36.072 1.00 15.00 

-0.809 80.032 36.242 1.00 44.98 

78.749 35.959 45.837 1.00 15.00 

77.355 35.886 45.407 1.00 15.00 

77.253 36.134 43.900 1.00 15.00 

77.846 37.757 43.356 1.00 15.00 

76.743 34.530 45.750 1.00 15.00 

75.547 34.433 46.035 1.00 51.99 

-3.379 79.751 39.065 1.00 15.00 

-4.329 79.529 37.983 1,00 15.00 

-4.969 78.135 38.108 1.00 15.00 

-5.608 77.953 39.368 1.00 15.00 

-5.412 80.607 37.811 1.00 15. OQ 

-5.844 80.872 36.688 1.00 50.32 

72.877 35.045 48.647 1.00 15.00 

71.967 35.210 47.518 1.00 15.00 

70.808 34.203 47.612 1.00 15.00 

70.117 34.298 48.855 1.00 15.00 

71.429 36.633 47.314 1.00 15.00 

71.171 37.038 46.181 1.00 51.16 

-5.890 81.214 38.916 1.00 15.00 

-5.680 80.84B 40.329 1.00 15.00 

-6.925 82.249 38.807 1.00 15.00 

-7.492 82.306 40.226 1.00 15.00 

-6.300 82.007 41.063 1.00 15.00 

-6.425 83.625 38.355 1.00 15.00 

-7.203 84.577 38.280 1.00 51.31 

71.206 37.391 48.407 1.00 15.00 

71.113 36.974 49.819 1.00 15.00 

70.693 38.757 48.244 1.00 15.00 

70.095 39.056 49.618 1.00 15.00 

70.984 38.290 50.537 1.00 15.00 

71.718 .39.823 47.838 1.00 15.00 

71.450 41.019 47.965 1.00 46.61 

-5.121 83.751 38.129 1.00 15 \ 00 

-4.559 85.019 37.679 1.00 15.00 

-3.083 85.168 38.106 1.00 15.00 

-2.379 86.251 37.290 1.00 15.00 

-3.020 85.485 39.601 1.00 15.00 

-1.630 85.776 40.115- 1.00 15.00 

-4.723 85.171 36.170 1.00 15.00 

-4.843 86.285 35.658 1.00 58.80 

72.908 39,398 47.423 1.00 15.00 
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Figure 4 

73.943 40.337 46.995 1.00 

75.368 39.820 47.340 1.00 

76.438 40.579 46.551 1.00 

75.615 39.962 48.845 1.00 

77.029 39.635 49.276 1.00 

73.819 40.639 45.503 1.00 

74.114 41.752 45.061 1.00 

-4.728 84.041 35.469 1.00 

-4.891 B4.018 34.016 1.00 

-4.744 82.580 33.453 1.00 

-4.742 82.605 31.92$ 1.00 

-3.447 81.941 33.965 1.00 

-3.270 80.498 33.543 1.00 

-6.269 84.582 33.638 T.00 

-6.446 85.153 32.554 1.00 

73.381 39.644 44.737 1.00 

73.200 39.791 43.295 1.00 

72.752 38.455 42.652 1.00 

72.736 38.578 41.134 1.00 

73.699 37.325 43.069 1.00 

73.305 35.962 42.537 1.00 

72.153 40.874 43.004 1.00 

72.218 41.552 41.975 1.00 
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Figure 5 
Table 8 



61.335 49.245 1.00 61.00 N 

61.919 4B.416 1.00 61.43 C 

61.966 46.979 1.00 61.58 C 

61.199 47.016 1.00 63.73 C 

61.064 48.459 1.00 62.97 C 

63.307 48.907 1.00 58.16 C 

64.150 48.681 1.00 61.01 O 

63.542 49,529 1.00 55.37 N . 

64.685 50.431 1.00 53.40 C 
64.414 51.056 1.00 52.92 C 
63.562 50.074 1.00 53.95 C 
62.755 49.377 1.00 56.47 C 
65.984 49.683 1.00 53.45 . C 
65.933 48.486 1.00 58.59 O 
67.100 50.368 1.00 53.10 N 
68.411 49.742 1.00 52;79 C 
69.477 50.736 1.00 54.93 C 
69.157 51.728 1.00 56.94 O 

70.742 50.474 1.00 54.97 N 
71.805 51.421 1.00 56.07 C 
73.090 50.767 1.00 55.27 C 
72.764 49.341 1.00 56.42 C 
71.274 49.241 1,00 56.05 C 
71,886 51.550 1.00 56.18 C 
71.707 50.543 1.00 55.49 O 
72.100 52.779 1.00 58.13 N 
72.390 53.080 1.00 59.19 C 
72.433 54.640 1.00 62.47 C 

72.686 55.359 1.00 67.08 O 
71.068 55.183 1.00 63.35 C 
73.729 52.439 1.00 55.16 C 
74.773 52.658 1.00 49.57 O 
73.688 51.647 1.00 52.53 N 
74.872 50.942 1.00 57.60 C 
74.258 49.792 1.00 58.33 C 
72.968 50.404 1.00 57.16 C 
72'.483 51.341 1.00 51.06 C 

75.743 51.773 1.00 62.91 C 
75.227 52.604 1.00 66.03 O 
77.048 51.521 1.00 66.79 N 
78.003 52.213 1.00 67.88 C 
79.408 52.135 1.00 65.49 C 
79.725 53.197 1.00 64.24 C 
81.209 53.112 1.00 63.99 C 
79.351 54.623 1.00 63.01 C 
77.983 51.620 1.00 72.00 C 
77.570 50.462 1.00 71.33 O 
78.432 52.407 1.00 74.53 N 
78.218 52.092 1.00 74.69 C 
79.290 52.938 1.00 76.28 - C 
79.481 54.139 1.00 76.86 C 
79.182 53.674 1.00 75.76 C 
78.338 50.611 1.00 73.79 C 
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Figure 5 . 
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Figure 5 

15.099 76.282 39.738 

15.997 76.867 37.780 
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Figure 5 

66.922 49.848 1.00 39.56 C 

67.503 -49.347 1.00 38.41 C 

68.635 50.230 1.00 41.14 C 

68.379 51.464 1.00 44.54 C 

69.446 52.298 1.00 45.74 C 

70.766 51.B89 1.00 46.60 C 

71.821 52.675 1.00 45.62 O 

71.042 50.674 1.00 47.77 C 

69.979 49.855 1.00 45.52 C 

65.666 49.108 1.00 40.64 C 

64.662 49.162 1.00 42.58 O 

65.701 48.459 1.00 42.28 N 

64.550 47.732 1.00 3B.32 C 

64.652 46.260 1.00 38.27 C 

65.696 45.789 1.00 31.02 O 

63.568 45.532 1.00 37.41 N 

63.528 44.082 1.00 41.74 C 

62.236 43.653 1.00 41.63 C 

61.366 44.B19 1.00 39.11 C 

62.281 46.017 1.00 36.48 C 

63.440 43.664 1.00 43.12 C 

63.605 42.479 1.00 46.43 O 

63.149 44.567 1.00 44.24 N 

63.064 44.199 1.00 44.62 C 

61.615 44.191 1.00 45.28 C 

61.532 43.669 1.00 45.09 C 

60.732 43.360 1.00 48.58 C 

63.862 45.168 1.00 45.62 C 

63.344 46.197 1.00 43.30 . 0 

65.127 44,853 1.00 45.92 N 

65.910 45.719 1.00 45.96 C 

67.086 46.342 1.00 46.47 • C 

68.039 . 45.368 1.00 44.03 C 

67.808 44.907 1.00 41.08 C 

68.704 44.074 1.00 40.21 C 

69.865 43.691 1.00 45.93 C 

70.130 44.159 1.00 47.81 C 

69.214 45.004 1.00 44.86 C 

66.370 45.087 1.00 46.81 C 

66.370 43.866 1.00 39.93 0 
66.741 45.976 1.00 47.98 N 
67.268 45.647 1.00 47.44 C 
66.679 46.578 1.00 47.18 C 
65.288 46.281 1.00 49.03 0 
67.314 46.298 1.00 48.72 C 
68.793 45.753 1.00 47.38 C 
69.363 46.632 1.00 45.50 0 
69.445 44.647 1.00 48.54 N 
70.893 44.773 1.00 50.54 C 
71.363 43.776 1.00 51.26 C 
72.677 44.156 1.00 50.62 C 
72.396 45.222 1.00 50.37 C 
73.351 42.942 1.00 53.87 C 
71.393 44.353 1.00 51.85 C 
70.828 43.459 1.00 52.61 0 
72.470 44.984 1.00 52.53 N 
72.975 44.760 1.00 52.74 C 
73.223 46.098 1.00 53.90 C 
71.927 46.732 1.00 51.78 C 
71.219 47.588 1.00 53.07 C 

70.004 48.147 1.00 52.00 C 
69.488 47.828 1.00 49.04 C 
68.302 48.375 1.00 49.91 0 
70.157 46.962 1.00 50.81 C 

71.371 46.415 1.00 53.21 C 
74.231 43.925 1.00 50.64 C 
75.202 44.293 1.00 48.54 0 
74.189 42.787 1.00 49.74 N 
75.373 41.956 1.00 53.60 C 

75.005 40.484 1.00 56.56 C 
75.041 40.071 1.00 58.76 C 
76.008 39.203 1.00 61.14 C 
76.056 38.859 1.00 62.77 C 
75.154 39.394 1.00 63.28 C 
74.195 40.280 1.00 62.51 C 
74.146 40.618 1.00 61.18 C 
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Figure 5 

1.951 75.922 42.228 1.00 54.68 ■ C 

0.972 75.501 41.589 1.00 50.49 O 

1.872 76.844 43.195 1.00 54.77 N 

0.630 77.105 43.906 1.00 53.82 C 

0.129 75.824 44.575 1.00 56.38 C 

0.739 75.363 45.552 1.00 55.79 O 

-0.942 75.230 44.025 1.00 55.08 N 

-1.569 74.012 44.576 1.00 55.18 C 

-3.089 74.161 44.563 1.00 56.69 C 

-3.800 75.320. 45.276 1.00 56.46 C 

-5.292 74.931 45.380 1.00 54.73 C 

-3.188 75.669 46.662 1.00 55.21 C 

-1.246 72.698 43.845 1.00 57.42 C 

-1.674 71.626 44.293 1.00 52.71 O 

-0.516 72.798 42.723 1.00 61.66 N 

-0.111 71.661 41.863 1.00 60.29 C 

0.209 72.188 40.452 1.00 61.71 C 

0.368 71.132 39.350 1.00 59.57 C 

0.372 71.806 37.973 1.00 60.32 C 

1.078 70.969 36.908 1.00 61.19 C 

0.226 69.892 36.339 1.00 62.02 N 

1,130 70.939 42.422 1.00 59.36 C 

2.147 71.590 42.683 1.00 56.33 O 

I. 054 69.616 42.621 1.00 59.37 N 
2.209 68.859 43.122 1.00 59.06 C 
1.563 67.674 43.855 1.00 58.64 C 
0.253 67.424 43.104 1.00 58.90 C 

-0.110 6B.730 42.404 1.00 59.72 C 

3.083 68.392 41.959 1.00 57.10 C 

2.597 67.736 41.031 1.00 56.75 O 

4.363 68;742 42.030 1.00 56.17 N 

5,362 68.443 40.999 1.00 53.86 C 

6.014 69.767 40.526 1.00 53.92 C 

4.961 70.705 39.933 1.00 56.77 C 

4.096 70^067 38.887 1.00 55.94 C 

7.147 69.526 39.548 1.00 51.49 C 

6.441 67.590 41.621 1.00 50.14 C 

7.025 68.001 42.619 1.00 51.32 O 

6.722 66.420 41.056 1.00 46.26 N 

7.957 65.703 41.411 1.00 46.21 C 

7.756 ' 64.180 41.322 1.00 46.55 C 

9.033 63.436 41.645 1.00 47.93 C 

6.693 63.753 42.283 1.00 49.41 C 

9.150 66.159 40.526 1.00 41.49 C 

9.116 65.971 39.333 1.00 43.55 O 

10.166 66.793 41.115 1.00 37.92 N 

II. 459 67.063 40.446 1.00 39.38 C 
12.157 68.313 41.052 1.00 38.98 C 
13.475 68.571 40.386 1.00 42.09 C 
11.307 69.536 40.928 1,00 42.46 C 
12.465 65.877 40.565 1.00 37.99 C 
12.687 65.372 41.663 1.00 38.01 O 
13.063 65.444 39.445 1.00 38.10 N 
14.171 64.463 39.432 1.00 35.38 C 
13.926 63.441 38.355 1.00 35.52 C 
12.619 62.673 38.325 1.00 35.97 C 
12.766 61.531 37.324 1.00 35.60 C 
12.199 62.167 39.693 1.00 34.85 C 
15.517 65.135 39.127 1.00.34.72 C 
15.652 65.742 38.087 1.00 33.92 O 
16.517 65.033 40.001 1.00 38.68 N 
17.722 65.849 39.824 1.00 39.77 C 
18.182 66.464 41.122 1.00 38.32 C 
19.048 67.671 40.940 1.00 37.02 C 
20.373 67.701 41.316 1.00 39.96 M 
20.879 68.900 41.067 1.00 39.45 C 
19.925 69.653 40.548 1.00 38.20 N 
ie!770 68.906 40,456 1.00 37.78 C 
18.895 65.154 39.115 1.00 44.11 C 
19.109 65.371 37.912 1.00 49.18 O 
19.682 64.342 39.791 1.00 39.04 N 
20.921 63.947 39.138 1.00 38.00 C 
20^737 62.981 37.976 1.00 36.52 C 
19 630 62.517 37.713 1.00 32.66 O 
21.843 62.656 37.306 1.00 37.59 N 
21^850 61.683 36.203 1.00 40.74 C 
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Figure 5 
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Figure 5 
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Figure 5 
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Figure 5 

86.498 19.672 . 1.00 45.41 C 

87.109 18.392 1.00 46.40 C 

B8.300 18.757 1.00 46.13 C 

87.531 17.468 1.00 45.33 C 

85.536 20.825 1.00 44.45 C 
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85.071 21.938 1.00 44.37 N 
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84.663 24.306 1.00 46.00 C 
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85.875 26.401 1.00 48.81 C 
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84.347 23.274 1.00 47.76 C 
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82.070 21.142 1.00 42.64 C 

81.086 20.481 1.00 45.71 0 
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B2.916 19.279 1.00 39.19 C 
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84.806 20.382 1.00 41.48 C 
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81.358 17.924 , .1.00*38.; 48 0 
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80.189 19.990 1.00 39.59 C 
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78.252 18.641 ' 1.00 37; 37 0 
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SUBSTITUTE SHEET (RULE 26) 



WO 03/035693 



330/514 



PCT/GB02/04872 



ATOM 


4747 


N ' 


GLN A 324 


38.783 


ATOM 


4749 


CA 


GLN A 324 


39.142 


ATOM 


4751 


CB 


GLN A 324 


39.378 


ATOM 


4754 


CG 


GLN A 324 


38.058 




4757 


CD 


GLN A 324 


38.211 


ATOM 


H IDO 


0E1 


GLN A 324 


37.653 


ATOM 


A1KQ 
H iOH 




gt.n a 324 


38 . 939 


ATOM 


Aid? 
4 f OC 


n 


GLN A 324 


40.299 


ATOM 


4 r 0 J 


U 


GT.N A 324 


40.287 


ATOM 


*i / OH 


N 


GLn a 325 


41.264 


ATOM 


At fid 
H 1 DO 


r**a 


fiT.n a 325 


42.322 


ATOM 


s / 00 


LO 


GT.n A 325 


43.3B9 


ATOM 


1 / / X 


CG 


GLO A 325 


44.199 


ATOM 


All A 
Hit'* 




/2T.n a 325 


44.723 




4775 


0E1 


GLU A 325 


45.136 


ATOM 


4776 


OE2 


GLU A 325 


44.730 


a TOM 


4777 




GLU A 325 


41.702 


a TOM 


4778 




GLU A 325 


42.127 


ATOM 


4779 


u 


GLU A 326 


40.727 


ATOM 


4781 


CA 


GLU A 326 


40.058 


niun 


4783 


CB ' 


GLU A 326 


39.151 


ATOM 


4786 


CG 


GLU A 326 


39.680 


ATOM 


4789 


CD 


GLU A 326 


38.692 


ATOM 


4790 


OBI 


GLU A 326 


38.870 


ATOM 


4791 


0E2 


GLU A 326 


37.747 


ATOM 


4792 


c 


GLU A 326 


39.231 


ATOM 


4793 


o 


GLU A 326 


39.096 




470,4 


N 


ILE A 327 


38.673 


a TDM 

til UM 


4796 


CA 


ILE A 327 


37.868 


a TOM 




CB 


TLE A 327 


37.031 


a TOM 


4800 


CGI 


TLE A 327 


35.808 


ATOM 


4803 


CDl 


ILE A 327 


35.255 


ATOM 


4807 


CG2 


ILE A 327 


36.528 


a TOM 


4811 


r 


ILE A 327 


38.794 


ATOM 


4812 


o 


ILE A 327 


38.494 


ATOM 


HOI J 


N 


GLU A 328 


39.933 


a TOM 


y O J. □ 


CA 


GLU A 328 


40.818 


ATOM 


ARM 
HOI f 


CB 


GLU A 328 


41.978 


a TOM 




CG 


GLU A 328 


41.547 


a TOM 


A QOQ 


CD 


KT.n A 328 


41.939 


ATOM 


4824 


0E1 


GLU A 328 


42.042 


ATOM 




0E2 


GLU A 328 


42.134 


ATOM 


4826 




GLU A 328 


41.395 


ATOM 


4827 


o 


GLU A 328 


41.567 


ATOM 


ZR9P 




ARG A 329 


41.689 


ATOM 


HO JU 




arc A 329 


42.295 


ATOM 


4R39 


CB 


ARC A 329 


42.824 


ATOM 


A AT"* 


CG 


ARG A 329 


43.601 


ATOM 


40JD 


CD 


ARG A 329 


44 .437 


ATOM 


dR41 
HO H X 


KE 


ARG A 329 


43.617 


a TOM 


4643 


CZ 


ARG A 329 


42.789 


ATOM 


4844 


NH1 


ARG A 329 


42.073 


ATOM 


4847 


NH2 


ARG A 329 


42.660 


ATOM 


4850 


C 


ARG A 329 


41.322 


ATOM 


4651 


0 


ARG A 329 


41.640 


ATOM 


4852 


N 


VAL A 330 


40.130 


ATOM 


4854 


CA 


VAL A 330 


39.210 


ATOM 


4856 


CB 


VAL A 330 


38.355 


ATOM 


4858 


CGI 


VAL A 330 


37.335 


ATOM 


4862 


CG2 


VAL A 330 


39.265 


ATOM 


4866 


C 


VAL A 330 


38.307 


ATOM 


4867 


0 


VAL A 330 


38.000 


ATOM 


4868 


N 


ILE A 331 


37.866 


ATOM 


4870 


CA 


ILE A 331 


37.004 


ATOM 


4872 


CB 


ILE A 331. 


35.813 


ATOM 


4874 


CGI ILE A 331 


34.958 


ATOM 


4877 


CDl 


ILE A 331 


34.104 


ATOM 


4881 


CG2 


ILE A 331 


34.991 


ATOM 


4885 


C 


ILE A 331 


37.882 


ATOM 


4886 


0 


ILE A 331 


38.740 


ATOM 


4887 


N 


GLY A 332 


37.731 


ATOM 


4889 


CA 


GLY A 332 


38.594 


ATOM 


4692 


C 


GLY A 332 


38.040 


ATOM 


4693 


0 


GLY A 332 


37.224 


ATOM 


4894 


N 


ARG A 333 


38.503 


ATOM 


4896 


CA 


ARG A 333 


37.792 



Figure 5 

76.291 12.397 1.00 33.29 N 

76.632 11.018 1.00 35.05 C 

78.135 10.851 1.00 33.36 C 

78.895 10.954 1.00 36.13 C 

80.394 11.048 1.00 39.46 C 

81.117 10.225 1.00 45.13 . O 

80.871 12.055 1.00 37.61 N 

75.790 10.464 1,00 37.37 C 

75*500 9.284 1.00 36.64 O 

75.376 11.292 1.00 41.50 N 

74.471 10.810 1.00 45.85 C 

74.147 11.876 1.00 49.45 C 

75.311 12.443 1.00 55.86 C 

75.050 13.894 1.00 63.74 C 

76.032 14.578 1.00 66.43 O 

73.879 14.382 1.00 60.05 O 

73.145 10.340 1.00 41.29 C 

72.570 9.330 1.00 39.43 O 

72.649 11.097 1.00 37.26 N 

71.387 10.759 1.00 36.36 C 
70.942 11.894 1.00 36.25 C 
69.760 12.666 1.00 44.16 C 
69.235 13.699 1.00 45.43 C 
68.098 14.156 1.00 42.07 O 

69.958 14.055 1.00 51.06 O 
71.519 9.484 1.00 33.91 C 
70.579 8.721 1.00 34.9.0 0 
72.703 9.264 1.00 36.11- N 

72.959 8.080 1.00 35.38 C 
74.272 8.236 1.00 34.85 C 
74.072 9.161 1.00 32.43 C . 
75.406 9.769 1.00 27.91 \ C; 
74.727 6.910 1.00 36.52 C; 
72.978 6.858 1.00 36.05 C 
72.297 5.899 1.00 35.24 O <: 
73^676 6.907 1.00 31.70 N! . 
73.769 5.744 1.00 35.70 P ■ 
74.733 5.974 1.00 42.58 C 
76.090 6.530 1.00 55.27 C. 
77.281 5.669 1.00 61.81 k C .... 

78.388 6.262 1.00 63.73 9 
77.115 4.431 1.00 64.09 0 

72.434 5.390 1.00 32.26 C 
72.111 4.232 1.00 39.89 O 
71.657 6.405 1.00 33.18 N 
70.367 6.247 1.00 32.33 C 
69.914 7.594 1.00 37.29 C 
68.609 7.591 1.00 37.86 C 

68.435 .8.850 1.00 40.77 C 
68.429 10.059 1.00 43.88 N 
67.444 10.390 1.00 48.37 ■ C 
67.528 11.507 1.00 53.84 N 
66.373 9.609 1.00 48.46 N 
69.352 5.719 1.00 32.80 C 
68.638 4.798 . 1.00 36.99 O 
69.280 .6.289 1.00 35.10 N 
6B.179 5.964 1.00 34.67 C 
67.883 7.163 1.00 29.56 C 
66.817 6.841 1.00 27.52 C 
67.442 8.312 1.00 33.13 C 
68.450 4.756 1.00 36.91 C 
67.579 3.948 1.00 37.51 O 
69.685 4.706 1.00 40.47 N 
70.234 3.696 1.00 43.36 ' C 
70.939 4.436 1.00 44.90 C 
69.909 5.177 1.00 40.90. C 
70.486 6.285 1.00 42.14 C 
71.787 3.477 1.00 47.75 C 
71.285 3.056 1.00 44.48 C 
71,866 3.704 1.00 49.14 O 
71.58B 1.806 1.00 46.66 N 
72.657 1.341 1.00 52.01 C 

74.022 1.710 1.00 56.70 C 
74.173 2.624 1.00 52.62 0 

75.023 0.974 1.00 61.43 N 
76.272 0.877 1.00 65.02 C 
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Figure 5 

77.433 0.665 1.00 69.20 C 
78.673 1.477 1.00 76.16 C 
79.582 1.773 1.00 82.47 C 
79.825 3.208 1.00 87.57 N 
80.304 3.785 1.00 89.41 C 
80.596 3.062 1.00 87.42 N 
80.494 5.105 1.00 91.93 N 
76.175 -0.275 1.00 64.51 C 
76.914 -0.314 1.00 66.27 0 
75.258 -1.207 1.00 63.43 N 
75.118 -2.373 1.00 62.62 C 
74.293 -3.479 1.00 62.79 C 
75.007 -4.069 1.00 62.48 C . 
74.997 -3.494 1.00 55.74 0 
75.638 -5.220 1.00 64.99 N 
74.564 -1.979 1.00 60.85 C 
75.356 -1.681 1.00 68.90 0 
73.241 -1.933 1.00 54.09 • N 
72.629 -1.665 1.00 48.24 C 
71.116 -1.860 1.00 45.29 C 
70.372 -0.750 1.00 45.12 C 
6B.877 -0.902 1.00 48.01 C 
68.230 0.396 1.00 53.18 N 
67.900 1.183 1.00 56.61 C 
68.125 0.824 1.00 62.37 N 
67.328 2.344 1.00 55.54 N 
72.935 -0.308 1.00 47.83 C 
73.619 0.562 1.00 46.26 O 

72.434 -0.148 1.00 47.11 N 
72.794 0.999 1.00 47.48 C 
73.091 0.573 1.00 50.15 C 
71.905 0.594 1.00 50.68 0 
71.606 1.943 1.00 42.70 C 
70.481 1.505 1.00 42.29 O 
71.858 3.237 1.00 31.47 N 
70.792 4.219 1.00 26.65 C 
71.522 5. 528 1.00 25.01 C 
72.855 5.241 1.00 28.90 C 
73.169 3.876 1.00 31.55 C 
69.855 4.002 1.00 30.64 C 
70.205 3.285 1.00 35.50 0 

68.694 4.628 1.00 31.91 N 
67.721 4.617 1.00 34.56 C 
66.931 3.332 1.00 39.53 C 
65.816 3.386 1.00 48.28 S 
66.801 5.823 1.00 34.88 C 
66.912 6.538 1.00 33.84 O 
65.869 6.046 1.00 35.59 N 
65.153 7.312 1.00 40.47 C 
64.515 7.584 1.00 40.15 C 
65.558 7.776 1.00 43.91 C 
66.827 9.074 1.00 44.24 S 
65.750 10.418 1.00 45.49 C 
64.145 7.423 1.00 43.56 C 
63.902 8.518 1.00 46.39 O 
63.569 6.297 1.00 47.33 N 
62.634 6.298 1.00 50.71 C 
62.102 4.900 1.00 53.74 C 

60.695 4.704 1.00 60.69 C * 
60.197 3.340 1.00 68.40 C 
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63.260 6.822 1.00 45.23 C 
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66.701 6.740 1.00 43.34 C 
66.834 5.253 1.00 42.57 C 
65.898 4.575 1.00-50.82 0 
67.843 4.676 1.00 42.69 0 
65.300 8.712 1.00 41.93 C 
65.674 9.235 1.00 38.54 0 
64.981 9.421 1.00 41.23 N 
65.252 10.844 1.00 39.88 C 
65.312 11.384 1.00 43.34 C 
65.659 12.859 1.00 44.67 C 
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Figure 5 
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Figure 5 

63.918 19.085 1.00 41.27 O 

64.966 18.709 1.00 35.97 N 

64.313 17.440 1.00 35.15 C 

64.596 17.053 1.00 36.18 C 

64.444 15.608 1.00 37.68 C 

63.668 17.817 1.00 37.95 C 

64.772 16.394 1.00 35.49 C 

64.071 15.403 1.00 37.18 0 

65.930 16.638 1.00 32.32 » 

66.493 15.714 1.00 32,02 C 

66.447 16.112 1.00 30.69 C 

67.287 15.681 1.00 30.06 O 

65.474 16.921 1.00 32.23 » 
65.366 17.373 1.00 30.35 C 
64.069 18.147 1.00 33.34 C 
64.011 19.018 1.00 37.16 C 
62.667 19.724 1.00 41.67 C 
62.573 20.575 1.00 43.82 O 
61.715 19.452 1.00 40.54 O 
65.402 16.173 1.00 34.52 C 
66.271 16.068 1.00 34.74 O 

64.475 15.244 1.00 33.78 N 
64.380 14.065 1.00 30.60 C 
63.202 13.227 1.00 29.59 C 
65.650 13.238 1.00 27.90 C 
66.119 12.771 1.00 33.40 O 
66.218 13.060 1.00 28.70 N 
67.378 12.185 1.00 31.96 C . 
67.735 12.036 1.00 35.75..' C . 
69.006 11.242 1.00 38.50< . *C- 
68.978 9.927 1.00 39.16, -.^C, 
69.129 11.039 1.00 41.40. ,, ; C , . , 
6B.563 12.735 1.00 29.39- . ; r.C 
69.213 12.039 1.00 29.69; .y- O 
68.822 14.006 1.00 28.50 .-• w N. 

69.806 14.737 1.00 29.61 ;-C 

69.807 16.192 1.00 31.64 , C. 
69.578 14.683 1.00 29.34, ,.C 

70.518 14.579 1.00 27.63' O 
68.336 14.792 1.00 30.20 . N 
68.049 14.654 1.00 30.81. ./ . ' C 
68.466 13.281 1.00 31.14 ' C 
69.098 13.169 1.00 33.60 O 
68.143 12.241 1.00 31.76 N 
68.560 10.893 1.00 30.44 C 
68.036 9.867 1.00 33.85 C 
66.529 9.703 1.00 34.09 C 
66.052 8.180 1.00 40.71 S 
64.423 8.773 1.00 46.61 C 
70.058 10.778 1.00 31.49 C 
70.601 10.153 1.00 31.80 O 
70.750 11.368 1.00 31.77 K 
72.189 11.159 1.00 31.06 C 
72.768 11.665 1.00 32.92 C 
72.292 10.901 1.00 37.84 C 
73.031 11.338 1.00 42.20 C 
74.273 11.212 1-00 48.38 O 
72.390 11.827 1.00 43.20 O 
72.878 11.841 1.00 28.05 C 
73.795 11.273 1.00 24.94 0 
72.436 13.057 1.00 23.88 N 
73.082 13.851 1.00 23.07 C 
72.502 15.251 1.00 21.11 C 
72.946 16.039 1.00 21.47 C 
72.136 17.327 1.00 17.30 C 
74.482 16.304 1.00 19.21 C 
72.876 13.189 1.00 27.27 C 
73.839 12.946 1.00 28.54 O 
71.614 12.873 1.00 27.35 N 
71.239 12.241 1.00 32.64 C 
69.713 12.098 1.00 33.69 C 
69.276 11.454 1.00 38.12 C 
68.884 10.132 1.00 42.68 C 
68.497 9.534 1.00 44.33 C 

68.519 10.257 1.00 41.55 C 
68.906 11.570 1.00 42.78 C 
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Figure 5 

29.849 69.291 12.163 1.00 38.44 

27.516 71.934 10.869 1.00 28.85 

28.557 72.550 10.629 1.00 26.64 

26.511 71.865 10.003 1.00 26.75 

26.618 72.404 8.646 1.00 25.60 

25.588 71.750 7.718 1.00 24.02 

25.711 70.229 7.637 1.00 29.23 

24.538 69.622 6.884 1.00 30.23 

27.040 69.836 6.977 1.00 32.39 

26.524 73.933 6.572 1.00 24.66 

27.193 74.520 7.743 1.00 25.52 

25.726 74.584 9.422 1.00 26.91 

25.689 76.057 9.415 1.00 29.36 

24.458 76.637 10.128 1.00 30.61 

23.129 76.250 9.533 1.00 26.90 

23.022 75.566 8.351 1.00 30.26 

21.769 75.223 7.843 1.00 35.27 

20.625 75.582 8.517 1.00 32.60 

20.730 76.264 9.692 1.00 28.39 

21.971 76.602 10.189 1.00 24.92 

26.924 76.660 10.088 1.00 32.33 

27.406 77.704 9.661 1.00 39.14 

27.426 76.040 11.151 1.00 30.79 

28.588 76.595 11.837 1.00 30.90 

28.768 75.961 13.229 1.00 31.56 

27.741 76.394 14.285 1.00 35.96 

28.048 75.723 15.583 1.00 34.66 

27.664 77.932 14.491 1.00 37.25 

29.855 76.444 10.969 1.00 29.88 

30.603 77.411 10.788 1.00 21.51 

30.076 75.244 10.420 1.00 29.18 

31.265 74.998 9.587 1.00 29.64 

31.421 73.519 9.186 1.00 30.36 

30.180 72.986 8.699 1.00 34.16 

31.726 72.655 10.413 1.00 31.55 

31.195 75.850 8.355 1.00 28.81 

32.200 76.429 7.964 1.00 32.21 

29.999 75.959 7.771 1.00 28.40 

29.786 76.827 6.615 1.00 27.47 

28.373 76.676 6.022 1.00 27.93 

28.189 75.381 5.421 1.00 23.37 

30.124 78.281 6.937 1.00 29.16 

30.802 78.935 6.157 1.00 32.94 

29.708 78.784 8.094 1.00 30.00 

30.029 80.152 8.459 1.00 29.35 

29.287 80.566 9.734 1.00 32.21 

27.784 -80.697 9.469 1.00 31.56 

26.955 80.922 10.723 1.00 30.73 

29.883 81.897 10.326 1.00 28.30 

31.536 80.345 8.666 1.00 33.41 

32.108 81.337 8.224 1.00. 35.69 

32.187 79.420 9.356 1.00 35.56 

33.603 79.614 9.690 1.00 35.66 

33.990 78.786 10.902 1.00 34.89 

33.335 79.311 12.170 1.00 36.20 

33.499 78.292 13.258 1.00 38.84 

33.936 80.646 12.578 1.00 36.79 

34.517 79.312 8.508 1.00 33.89 

35.628 79.822 8.425 1.00 33.21 

34.034 78.507 7.581 1.00 32.49 

34.745 78.313 6.326 1.00 35.68 

34.046 77.252 5.501 1.00 35.01 

34.571 77.127 4.103 1.00 33.32 

33.744 76.183 3.287 1.00 34.79 

34.281 75.272 2.663 1.00 37.72 

32.423 76.388 3.285 1.00 37.41 

34.808 79.589 5.501 1.00 36.08 

35.740 79.788 4.744 1.00 38.92 

33.799 80.443 5.638 1.00 40.14 

33.636 81.577 4.745 1.00 36.76 

32.214 81.569 4.193 1.00 36.92 

32.004 80.487 3.126 1.00 32.82 

32.269 80.700 1.954 1.00 35.94 

31.522 79.339 3.537 1.00 31.89 

33.970 82.910 5.388 1.00 39.09 

34.328 83.863 4.692 1.00 41.52 
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Figure 5 
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Figure 5 
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Figure 5 

29.708 91.082 15.249 

29.923 89.590 15.542 

29.087 88.564 14.767 

29.554 87.153 15.059 

27.618 88.693 15.075 

29.723 91.349 13.734 

28.676 91.553 13.126 
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31.054 91.508 11.701 

32.148 92.526 11.370 

32.067 94.053 12.323 

31.468 90.190 11.130 

32.281 89.493 11.719 
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29.277 86.107 10.252 
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31.910 89.471 7.101 
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35.985 89i773 8.184 
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35.207 87.077 4.410 
34.751 87.552 3.025 
33.824 88.692 3.307 
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36.685 86.728 4.419 
37.503 87.616 4.589 
37.007 85.457 4.230 
38.375 84.956 4.378 

38.364 83.434 4.663 
37.644 83.142 5.959 
37.730 82.645 3.510 
39.286 85.243 3.154 
38.798 85.177 2.017 
40.514 85.528 3.240 
13.254 69.399 20.011 
11.888 68.813 19.774 
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9.740 67.672 20.185 
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17.374 71.301 21.212 
18.416 71.627 22.231 
16.040 70.621 21.369 
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Figure 5 

15.336 70,467 20.228 1.00 20.63 
15.588 70.137 22.677 1.00 23.60 
14.354 69.445 23.031 1.00 24.55 
13.463 69.168 22.102 1.00 31.77 
12.449 68.557 22.751 1.00 28.30 
11.256 68.083 22.112 1.00 23.38 
13.968 68,988 24.403 1.00 25.34 
14.746 69.134 25.660 1.00 28.53 
12.649 68.362 24.165 1.00 24.97 
11.648 67.689 25.007 1.00 29.38 
12.040 67.205 26.355 1.00 39.94 
11.299 65.972 26.776 1.00 39.12 
10.122 65.814 26.496 1.00 39.32 
11.905 65.153 27.430 1.00 38.31 
57.425 70.240 57.617 1.00 54.43 
56.507 69.110 57.849 1.00 56.73 
56.271 68.528 56.436 1.00 53.60 
56.624 69.568 55.503 1.00 50.78 
57.682 70.411 56.180 1.00 56.04 
57.165 68.075 58.721 1.00 58.10 
58.3B8 68.101 58.819 1.00 62.13 
56,374 67.174 59.307 1.00 58.74 
56.878 66.171 60.250 1.00 59.47 
55.598 65.572 60.850 1.00 60.00 
54.565 65.744 ' 59.789 1.00 59.27 ■ 
54.922 67.034 59.084 1.00 59.30 
57.637 65.126 59.486 1.00 61.31 
57.806 65.343 58.301 1.00 64.88 
58.064 64.040 60.131 1.00 64.11 
58.800 62.960 59.474 1.00 63.30 
59.794 62.275 60.400 1.00 61.87 
60.103 62.805 61.465 1.00 61.56 
60.351 61.136 59.993 1.00 61.48 
61.204 60.368 60.905 1.00 62.51 
61.495 59.058 60.154 1.00 58.71 
60.693 59.106 58.933 1.00 61.28 
60.324 60.542 58.651 1.00 61.30 
62.485 61.129 61.194 1.00 63.89 
62.873 62.013 60.415 1.00 65.39 
63.101 60.801 62.324 1.00 64.96 
64.306 61.479 62.771 1.00 64.37 
64.462 61.328 64.309 1.00 64.76 
63.852 62.459 64^937 1.00 64 . 88 
65.944 61.370 64.790 1.00 65.45 
65.479 60.896 61.992 1.00 62.59 
65.677 59.677 62.005 1.00 61.17 
66.207 61.753 61.273 1.00 61.95 
67.376 61.313 60.515 1.00 62.44 
67.590 62.447 59.503 1.00 62.98 
67.081 63.668 60.188 1.00 62.89 
65.962 63.201 61.100 1.00 63.02 
68.598 61.154 61.408 1.00 64.13 
68.954 62.06B 62.153 1.00 59.59 
69.212 59.977 61.319 1.00 66.86 
70.513 59.706 61.908 1.00 68.73 
70.813 58.211 61.770 1.00 68.51 
69.980 57.289 62.688 1.00 68.44 
70.343 55.858 62.373 1.00 66,02 
70.142 57.552 64.205 1.00 66.36 
71.624 60.574 61.269 1.00 71.80 
71.449 61.086 60.156 1.00 72.79 
72.751 60.750 61.974 1.00 74.33 
73.766 61.752 61.611 1.00 74.39 
75.021 61.271 62.371 1.00 75.81 
74.509 60.548 63.587 1.00 75.55 
73.142 60.024 63.204 1.00 75.72 
74.079 61.938 60.118 1.00 72.91 
73.912 63.056 59.616 1.00 74.58 
74 544 60.892 59.440 1.00 70.18 
74^979 61.022 58.041 1.00 70.53 
76.506 60.779 57.906 1.00 72.77 
76.860 59.290 58.170 1.00 74.43 
77.024 61.268 56.523 1.00 73.14 
74.215 60.113 57.062 1.00 67.69 
74.041 60.454 55.888 1.00 66.86 
73.774 58.958 57.548 1.00 65.10 
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Figure 5 

48.651 46.598 1.00 88.36 C 

47.788 46.626 1.00 87.53 0 

49.743 46.052 1.00 92.27 0 

50.597 47.953 1.00 76.85 C 

50.476 48.877 1.00 76.75 0 

51.759 47.669 1.00 73.15 N 

52.944 48.484 1.00 72.13 C 
54.160 48.102 1.00 71.66 C 
54.768 46.761 1.00 76.38 C 
56.269 46.818 1.00 77.55 C 
53.767 48.064 1.00 71.11 C 
53.311 48.473 1.00 71.34 C 
53.836 49,453 1.00 71.26 0 
52.995 47.390 1.00 69.88 N 
53.237 47.301 1.00 70.37 C 
52.939 $5,874 1.00 73.12 C 
52.050 45.106 1.00 73.89 0 
52.483 48.348 1.00 68.31 C 
53.037 48.870 1.00 58.23 . O 
51.227 48.643 1.00 70.45 N 
50.414 49.588 1.00 71.98 C 

48.945 49.616 1.00 76.11 C 
47.958 49.974 1.00 81.92 C 
47.565 48.739 1.00 85.82 C 
48.152 48.778 1.00 86.39 C 
48.286 47.403 1.00 B7.07 N 
50.991 50.995 1.00 67.60 C 
51.006 51.709 1.00 69.43. 0 
51.480 51.377 1.00 59.69 N 
52.093 52.685- 1.00 55.82; C 
52.033 53.114: 1.00 52.93 C 
52.734 52.226r 1.00 51.20.1 0 
53.522 52.801; 1.00 55.39; C 

54.020 53.906- 1.00 54.81. 0 
54.181 51.676. l.'OO 55.84.1 n 
55.520 51.6791 1,00 50.19; C 
56.173 50.317.. .1.00 49.20 C 
56.749 49.978 1.00 53.14 C 
57.236 48.533 ,1.00 58.13 C 
57.879 50.878 M. 00 52.49- C 
55.465 52.035.1.00 51.35/ C 
56.358 52.711 1.00 51.60 O 
54.432 51.579 1.00 47.22 N 
54.239 52.008 1.00 49.08 C 
53.204 51.117 1.00 48.57 C 
53.778 49.822 1.00 53.30 O 
52.882 51.613 1.00 47.10 C 
53.829 53.468 1.00 49.09 C 
54.152 54.173 1.00 50.67 0 
53.096' 53.978 1.00 50.90 N 
52.821 55.409. 1.00 48.33 C 
51,795 55.724 1.00 44.50 C 
50.392 55.428 1.00 43.30 C 
50.146 55.082 1.00 39.24 O 
49.458 55.559 1.00 46.07 N 
54.091 56.209 1.00 47.97 C. 
54.299 57.230 1.00 52.67 0 
54.965 55.744 1.00 43.86 N 
56.197 56.484 1.00 47.09 C 
56.894 55.994 1.00 46.41 C 
56.132 56.292 1.00 50.59 C 
56.643 55.466 1.00 51.68 C 
56.168 57.769 1.00 53.01 C 
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57.993 57.374 1.00 51.53 O 
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56.398 56.822 1.00 43.76 N 
55.874 57.848 1.00 44.17 C 
54.346 57.869 1.00 46.56 C 
53.591 57.191 1.00 49.93 C 
54.084 55.764 1.00 53.96 C 
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Figure 5 

47.701 53.200 55.116 1.00 

47.763 51.792 55.664 1.00 

50.439 56.434 59.230 1.00 

49.602 56.404 60.135 1.00 

51.666 56.933 59.396 1.00 

52.062 57.538 60.654 1.00 

53.236 56.763 61.392 1.00 

53.701 55.516 60.632 1.00 

54.413 57.678 61.745 1.00 

52.328 59.026 60.588 1.00 

52.103 59.695 61.578 1.00 

52.774 59.592 59.477 1.00 

53.073 61.040 59.505 1.00 

54.521 61.325 59.066 1.00 

55.591 60.726 59.980 1.00 

56.005 61.389 61.154 1.00 

56.982 60.835 61.990 1.00 

57.553 59.622 .61.648 1.00 

58.505 59.051 62.453 1.00 

57.163 58.957 60.490 . 1.00 

56.198 59.506 59.668 1.00 

52.068 -61.895 58.704 1.00 

52.189 63.130 58.620 1.00 

51.055 61.244 58.141 1.00 

50.054 61.940 57.354 1.00 
50.388 61.945 55.877 1.00 

51.244 61.189 55.423 1.00 
49.694 62.789 55.116 1.00 
49.854 62.831 53.666 1.00 
48.520 63.415 53.212 1.00 
48.159 64.352 54.283 1.00 
48.703 63.780 55.565 1.00 
50.986 63.747 53.238 1.00 
51.274 63.807 52.066 1.00 
51.604 64.467 54.159 1.00 
52.666 65.392 53.802 1.00 
52.162 66.842 53.752 1.00 
53.295 67.792 53.416 1.00 

51.055 66.983 52.727 1.00 
53.802 65.301 54.798 1.00 
53.799 66.003 55.801 1.00 
54.783 64.452 54.505 1.00 
55.916 64.266 55.412 1.00 
55.841 62.890 56.048 1.00 
55.805 61.745 55.081 1.00 
54.588 61.166 54.715 1.00 
54.544 60.059 53.857 1.00 
55.724 59.532 53.377 1.00 
56.943 60.104 53.753 1.00 
56.975 61.194 54.605 1.00 
57.305 64.461 54.823 1.00 
57.505 64.238 53.643 1.00 
58.248 64.866 55.682 1.00 
59.646 65.121 55.322 1.00 

60.245 66.246 56.188 1.00 
59.555 67.489 55.952 1.00 
61.697 66.553 55.772 1.00 
60.513 63.860 55.433 1.00 
60.3B2 62.992 56.278 1.00 
61.510 63.774 54.558 1.00 
62.397 62.622 54.486 1.00 
61.928 61.667 53.405 1.00 
61.969 60.214 53.835 1.00 
61.064 60.010 55.042 1.00 
61.548 59.331 52.684 1.00 
63.830 63.016 54.205 1.00 
64.105 64.097 53.701 1.00 
64.749 62.105 54.501 1.00 
66.174 62.406 54.370 1.00 
66.828 62.619 55.744 1.00 
66.547 64.004 56.293 1.00 
65.481 64.230 57.165 1.00 
65.216 65.512 57.651 1.00 
66.022 66.571 57.253 1.00 
65.763 67.828 57.712 1.00 
67.077 66.372 56.387 1.00 



PCT/GB02/04872 



54.10 C 
57.02 N 

46.11 C 

45.35 0 
45.72 N 
43.72 C 
45.18 C 
43.71 C 
46.50 C 
44.24 C 

47.49 0 
42.00 N 
45.60 C 

46.36 C 
52.77 C 

55.20 C 
52.62 C 
53.46 C 
51.41 0 

53.00 C 

50.50 C , 

47.89 C 

46.50 0 
49.06 N 

48.86 C 
50.05 C 

50.41 O 

48.49 N ■< . 

45.87 C 
46.26 - C 

43.56 C- .. . - a- 

45.22 C *: 
43.35 :x 

46.42 'f .0 >■ 
43.05 .. ' N ' 

43.45 I: C X 

42.30 C 

37.68 -v C. ; ; . -* ■ 

46.81 . C . . ' i' 

45.21 fC >:. 

44.95 ..0. ! 

47.54 N 

48.77 C 

50.24 C 

47.32 C 

49.90 C 

45.43 C 

44.94 C 

42.93 C 
43.60 C 
47.75 C 
46.86 0 

48.23 N 
49.54 C 

50.95 C 
53.54 0 
51.12 C 
51 . 62 C 
53.03 0 

53.01 N 

53.33 C 
53.41 C 
56.72 C 

59.62 C 

59.51 C 
53.29 C 

48.63 0 
57.56 N 

58.88 C 

59.94 C 

59.91 C 

57.34 ' C 
59.65 C 
57.40 C 
58.80 0 
56.10 C 



SUBSTITUTE SHEET (RULE 26) 



WO 03/035693 



352/514 



PCT/GB02/04872 



ATOM 


8085 


CD2 TYR B 


68 


67 .331 


ATOM 


8087 


C 


TYR B 


68 


66.919 


ATOM 


8088 


O 


TYR B 


68 


67.018 


ATOM 


8089 




PHE B 




67 .386 


ATOM 


8091 


CA 


PHE B 




66 .369 


ATOM 


8093 


CB 


PHE B 


CO 
D7 


68.054 


ATOM 


8096 


CG 


PHE B 


£Q 


OO » O UU 


ATOM 


8097 


CD1 


PHE B 


CQ 


66.962 


ATOM 


8099 


CE1. 


PHE B 


C Q 

by 


65 .866 


ATOM 


8101 


CZ 


PHE B 


£Q 

oy 


64 . 648 


ATOM 


8103 


CE2 


PHE B 


CO 


0*i . OZO 


ATOM 


8105 


CD2 


PHE B 


by 


fiC COT 


ATOM 


8107 


C 


PHE B 


CO 

by 


oy . / uu 


ATOM 


6108 


O 


PHE B 


by 


70 .103 


ATOM 


8109 


N 


GLY B 


70 


7fl 9^«i 


ATOM 


8111 


CA 


GLY B 


70 


71 .523 


ATOM 


8114 


C 


GLY B 


70 


71 1 70 


ATOM 


8115 


O 


GLY B 


/ u 


70 . 475 


ATOM 


8116 


N 


LEU B 


"71 


71 . 643 


ATOM 


8118 


CA 


LEU B 


"71 

/ J. 


71 .464. 


ATOM 


8120 


CB 


LEU B 


"71 


72.803 


ATOM 


8123 


CG 


LEU B 


71 
/ J. 


74 .002 


ATOM 


8125 


CD1 


LEU B 


71 


75.258 


ATOM 


8129 


CD2 


LEU B 


71 


73.705 


ATOM 


8133 


C 


LEU B 


7 1 


70 .405 


ATOM 


8134 


0 


LEU B 


71 


70.039 


ATOM 


8135 


N 


LYS B 


79 


69.932 


ATOM 


8137 


CA 


LYS B 


79 


68 .899 


ATOM 


8139 


CB 


LYS B 


79 


68 . 945 


ATOM 


8142 


CG 


LYS B 


79 


68 .407 


ATOM 


8145 
"814B' 


CD 


LYS B 


79 


69 . 138 


ATOM 


CE 


LYS B 




66 .198 


ATOM 


8151 


■NZ 


LYS B 


79 


67 . 453 


ATOM 


8155 


C 


LYS B 


"79 


67 . 520 


ATOM 


8156 ' 


0 


LYS B 


ft 


fi7 1 1 R 


ATOM 


'B157. 


N 


PRO B 


9 9 


66. 832 


ATOM 


B158' 


CA 


PRO B 


9 9 
/J 


09 . iJ3 


ATOM 


8160; 


CB 


PRO B 


73 


09 .ZOO 


ATOM 


8163 


CG 


PRO B 


99 


66 . 207 


'ATOM 


8166 


CD 


PRO B 


79 


67 .353 


ATOM 


8169 


-c 


PRO B 


99 
f J 


64 .424 


ATOM 


8170 


O 


PRO B 


73 


64 . 461 


ATOM 


8171 


N 


ILE B 


1A 
f H 


63.528 


ATOM 


8173 


CA 


ILE B 


7 A 


62 • 546 


ATOM 


8175 


CB 


ILE B . 


1A 


62 . 911 


ATOM 


8177 


CGI 


ILE B 


7 A 


63.731 


■ ATOM 


8180 


CD1 


ILE B 


I H 


65.089 


ATOM 


8184 


CG2 


ILE B 


7d 


61 . 684 


ATOM 


8188 


C 


ILE B 


74 


61.123 


ATOM 


8189 


0 


ILE B 


74 


60. 806 


ATOM 


8190 


N 


VAL B 


75 


60.264 


ATOM 


8192 


CA 


VAL B 


75 


58 . 846 


ATOM 


8194 


CB 


VAL B 


7^ 


58 .164 


ATOM 


Ol QC 


CGI 


VAL B 


75 


56. 670 


ATOM 


8200 


CG2 


VAL B 


75 


58.784 


ATOM 


8204 


C 


VAL B 


75 


58. 171 


ATOM 




O 


VAL B 


75 


58*214 


ATOM 


tJZUo 


N 


VAL B 


76 


57.567 


ATOM 




CA 


VAL B 


76 


56. 705 


ATOM 


q9 1 n 


CB 


VAL B 


76 


56.867 


ATOM 


0*51 9 


CGI 


VAL B 


76 


55. 926 


ATOM 


091 £ 


CG2 


VAL B 


76 


58.315 


ATOM 


8220 


C 


VAL B 


76 ' 


55.225 


ATOM 


8221 


0 


VAL B 


76 


54.803 


ATOM 


8222 


N 


LEU B 


77 


54.448 


ATOM 


8224 


CA 


LEU B 


77 


53.000 


ATOM 


8226 


CB 


LEU B 


77 


52.661 


ATOM 


8229 


CG 


LEU B 


77 


53.454 


ATOM 


8231 


CD1 LEU B 


77 


52.864 


ATOM 


8235 


CD2 LEO B 


77 


53.492 


ATOM 


8239 


C 


LEU B 


77 


52.278 


ATOM 


8240 


O 


LEU B 


77 


52.599 


ATOM 


8241 


N 


HIS B 


78 


51.290 


ATOM 


8243 


CA 


HIS B 


78 


50.747 


ATOM 


B245 


CB 


HIS B 


78 


51.090 


ATOM 


8246 


CG 


HIS B 


78 


50.870 



. Figure 5 

65.100 55.904 1.00 58.00 C 

61.372 53.554 1.00 56.90 C 

60.200 53.926 1.00 52.83 O 

61.828 52.399 1.00 59.37 N 

61.104 51.620 1.00 62.15 C 

61.199 50.127 1.00 66.18 C 

60.394 49.731 1.00 68.48 C 

59.027 49.575 1.00 72.74 C 

58.277 49.234 1.00 73.87 C 

58.886 49.068 1.00 73.18 C 

60.237 49.239 1.00 73.28 C 

60.988 49.574 1.00 71.75 C 

61.728 51.985 1.00 60.37 C 

62.755 51.431 1.00 58.76 O 

61.113 52.965 1.00 59.60 N 

61.695 53.588 1.00 59.13 C 

63.017 54.231 1.00 56.62 C 
63.053 55.237 1.00 56.39 O 
64.099 53.627 1.00 58.18 N 
65.450 54.158 1.00 60.50 C 
66.211 54.126 1.00 60.58 C 
65.595 54.881 1.00 56.51 C 
66.400 54.640 1.00 53.41 C 
65.474 56.385 1.00 52.28 C 
66.231 53.374 1.00 64.26 C 
67.349 53.765 1.00 64.74 O 
65.647 52.265 1.00 64.57 N 
66.248 51.435 1.00 62.31 C 
65.659 50.018 1.00 64.58 C 
66.591 48.898 1.00 65.46 C 
66.381 47.553 1.00 66.10 C 
66.505 46.353 1.00 66.80 C 
67.793 46.361 1.00 67.18 N 

66.018 52.081 . 1.00 61.67 C 
64.874 52.32B 1.00 55.55 O 
67.113 52.405 1.00 61.12 N . 
67.063 52.867 1.00 60.60 C 
68.391 53.611 1.00 60.05 C 
69.319 52.903 1.00 59.35 C 
68.494 52.416 1.00 60.56 C 
66.972 51.718 1.00 60.73 C 
67.B15 50.811 1.00 62.12 O 
65.980 51.783 1.00 60.24 N 
65.683 50.728 1.00 57.67 C 
64.327 50.037 1.00 58.41 C 
64.574 48.779 1.00 61.62 C 
65.134 49.033 1.00 63.47 C 
63.524 49.619 1.00 56.66 C 
65.620 51.303 1.00 54.66 C 
64.736 52.105 1.00 51.14 O 
66.541 50.876 1.00 49.32 N 
66.452 51.189 1.00 47.24 C 
67.817 51.075 1.00 48.31 C 
67.695 51.339 1.00 51.13 C 
68.815 52.037 1.00 47.75 C 
65.468 50.238 1.00 45.28 C 
65.645 49.029 1.00 46.01 0 
64.428 50.801 1.00 44.39 N 
63.475 50.084 1.00 41.92 C 
62.041 50.673 1.00 40.90 C 
61.047 50.004 1.00 41.02 C 
61.563 50.548 1.00 40.28 C 
63.863 50.196 1.00 41.32 C 
64.384 51.219 1.00 4'4.48 O 
63.602 49.145 1.00 41.22 N 
63.814 49.126 1.00 3B.54 C 
64.788 48.018 1.00 37.52 C 
66.086 47.974 1.00 39.76 C 
67.038 46.930 1.00 39.07 C 
66.763 49.346 1.00 43.97 C 
62.491 48.855 1.00 40.89 C 
61.810 47.897 1.00 42.81 O 
62.119 49.656 1.00 41.87 N 
60.772 49.533 1.00 43.43 C 
59.954 50.771 1.00 39.94 C 
58.494 50.594 1.00 34.21 C 
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Figure 5 

49.827 57.824 51.199 1.00 39.96- 

49.857 56.550 50.840 1.00 37.49 

50.892 56.369 50.035 1.00 38.17 . 

51.536 57.573 49.857 1.00 35.66 

49.239 60.698 49.179 1.00 49.50 

48.837 59.935 48.292 1.00 50.06 

48.383 61.482 49.810 1.00 52.24 

46.986 61.451 49.366 1.00 55.16 

46.750 61.780 47.874 1.00 51.62 

47.556 62.494 47.265 1.00 52.73 

45.650 61.296 47.282 1.00 45.31 

45.121 61.963 46.079 1.00 45.27 

43.817 61.325 45.534 1.00 47.61 

43.112 62.261 44.536 1.00 49.92 

43.477 62.278 43.186 1.00 50.04 

42.870 63.155 42.270 1.00 43.21 

41.916 64.043 42.708 1.00 44.61 

41.337 64.897 41.801 1.00 39.29 

41.540 64.065 44.049 1.00 46.47 

42.140 63.186 44.955 1.00 49.17 

44.844 63.449 4-6.354 1.00 43.35 

45.136 64.297 45.519 1.00 36.53 

44.236 63.751 47.504 1.00 43.44 

43.877 65.126 47.835 1.00 46.28 

43.038 .65.219 49.128 1.00 48.63 

41.583 64.700 49.042 1.00 54.17 

40.690 65.388 47.970 1.00 59.24 
39.750 64.723 47.448 1.00 54.16 
40.906 66.590 47.637 1.00 59.01 
45.124 66.004 47.904 1.00 45.86 
45.123 67.112 47.387 1.00 43.46 
46.199 65.490 48.496 1.00 49.51 
47.467 66.244 48.606 1,00 48.70 
48.371 65.586 49.638 1.00 48.45 
48.210 66.392 47.257 1.00 48.64 
48.783 67.446 46.953 1.00 41.19 
46.184 65.341 46.446 1.00 47.33 
48.800 65.401 45.132 1.00 48.02 
48.851 64.014 44.471 1.00 50.42 
49.353 64.113 43.057 1.00 50.55 
49,752 63.074 45.262 1.00 51.79 
48.020 66.375 44.256 1.00 46.38 
48.621 67.175 43.559 1.00 46.10 

46.691 66.316 44.320 1.00 46.25 
45.810 67.185 43.520 1.00 49.65 
44.320 66.731 43.627 1.00 54.96 
43.208 67.812 43.74B 1.00 56.68 
42.840 68.424 42.404 1.00 61.56 
41.537 69.251 42.474 1.00 64.54 
41.552 70.262 43.577 1.00 62.32 
45.980 68.628 43.921 1.00 47.57 
46.148 69.499 43.085 1.00 54.13 
45.975 68.880 45.212 1.00 47.57 
46.139 70.229 45.706 1.00 48.70 
46.006 70.248 47.220 1.00 51.82 
45.873 71.641 47.813 1.00 58.25 
45.554 71.607 49.297 1.00 64.65 
44.759 70.720 49.724 1.00 61.60 
46.103 72.475 50.023 1.00 70.15 
47.478 70.813 45.299 1.00 48.76 
47.590 72.004 45.037 1.00 54.33 
48.512 69.993 45.254 1.00 48.14 
49.834 70.527 44.971 1.00 49.19 
50.904 69.614 45.527 1.00 50.74 
50.022 70.716 43.473 1.00 46.53 
50.439 71.768 43.034 1.00 44.32 
49.693 69.694 42.697 1.00 46.16 
50.053 69.655 41.281 1.00 46.76 
50.064 68.197 40.782 1.00 44.90 
51 378 67.392 40.650 1.00 45.66 
52^79 67.999 41.355 1.00 46.69 
5l!l74 66.000 41.156 1.00 42.65 
49097 70.511 40.430 1.00 45.37 
49^460 71.035 39.382 1.00 43.94 
47 867 70.637 40.888 1.00 46.03 
46i881 71.387 40.155 1.00 45.64 
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Figure 5 
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44.939 65.293 12.283 

45.763 66.396 12.589 

49.826 68.792 12.994 

49.881 69.745 12.238 

50.631 68.685 14.037 

51.556 69.733 14.418 

52.778 69.939 13.527 

53.551 70.868 13.806 

52.974 69.120 12.482 

54.105 69.347 11.568 
55.441 69.034 12.264 
55.650 67.600 12.605 
56.773 67.371 13.979 
56.223 65.671 14.396 
54.053 68.648 10.196 

54.353 67.443 10.053 
53.719 69.448 9.186 
53.718 68.997 7.815 
52.461 68.208 7.535 
51.559 68.144 8.362 
52.446 67.569 6.373 
51.267 66.885 5.849 
51.313 66.867 4.302 
51.938 68.144 3.641 

51.345 68.485 2.250 
52.037 67.736 1.095 
51.378 67.865 -0.242 
51.114 65.454 6.385 
50.130 64.784 6.064 
52.091 64.986 7.171 
52.019 63.676 7.834 
53.358 62.957 7.696 
53.275 61.435 7.623 
54.441 60.800 6.841 
54.924 59.555 7.450 
54.296 58.380 7.386 
54.840 57.321 7.974 
53.125 58.252 6.744 
51.644 63.809 9.319 
52.277 64.546 10.071 
50.603 63.093 9.733 
50.152 63.127 11.126 
48.636 62.777 11.223 
48.323 61.417 10.940 
51.008 62.214 12.026 
51.664 61.296 11.544 
51.019 62.464 13.331 
51.642 61.514 14.254 
51.526 61.963 15.739 

52.106 63.359 15.987 
53.544 63.477 15.678 

52.159 60.906 16.662 
50.929 60.155 14.110 
51.550 59.111 14.160 
49.624 60.169 13.924 
48.877 58.933 13.922 

47.354 59.189 13.998 
46.497 57.945 13.749 
45.025 58.093 14.105 
44.546 59.226 14.352 
44.332 57.053 14.135 
49.256 58.090 12.728 

49.160 56.878 12.799 
49.698 58.710 11.641 
50.132 57.948 10.468 
50.384 58.845 9.250 
49.094 59.207 8.509 

48.346 58.277 8.101 
48.749 60.400 8.282 
51.407 57.235 10.817 
51.504 56.013 10.673 
52.376 58.023 11.285 
53.658 57.525 11.778 
54.3B2 58.627 12.510 
54.954 59.730 11.624 
55.770 60.707 12.452 



i nn ti fin 


£ 


i nn an 




i nn o.q co 

X . UU J3> OJ 


L. 


t nn oo AO 
X . UU J / . ic 


u 


i nn as. tc 
X » UU 45. '5 


u 
W 


t nn cn co 
l.UU dU.OJ 




i nn ce no 
1 . UU jo. \JC 




i nn co i o 
X . UU J / . 1 f 


U 


i nn CO a Q 
l.UU Do, fly 


N 


i nn cn ai 
l.UU oU.41 




1 nn cn ci 
1.00 bU.Dl 


C 


i n n co ci 
1.00 Dy.bl 


C 


i n n co i o 


S 


i nn cn oo 
X . UU jU . o r 


C 


1 nn ci oc 
1.00 bi.ya 


c 


i nn ci q? 
l.UU bl.oz 


0 


i nn cn oi 
1 , UU DU . y / 


N 


n nn co i o 
l.UU o^.lo 




i nn co vie 
l.UU bJ.flo 


C 


i nn ci do 
1 . UU bl . a y 




i nn cc co 
X . UU DO • 3 / 


VJ 


«i nn fifl 11 

I ■ UU DO . J J, 




1 nn oi an 

l.UU / X . 27U 


Q 


n nn 74 4c 


Q 


1 nn *7fi nc. 

X.UU 'DtUJ 


Q 


1 nn os no 

X.UU f D ,\J 1 




i nn 71 0,4 

X . UU / X . 3^* 


N 


i nn cc R7 
X.UU oo «o# 


n 

v.. 


1 nn cc qc 

X.UU 03.33 


n 


"i nn ci is 

X.UU O J. « t 0 


N 


1 nn cc ic • . 


Q 


1 nn cp- O.Q- 


Q 


1 nn ci -po > > 

X . UU , OX . D£ ( . 


n 
V. 


1 nn cc 4 c 

X.UU OjiSD; ; 


r 


i nn cc ,10- . ■ 

X.UU .DO. 13 • • *, f. 


H 


1 nn ca r/i 

X . UU OH .,0* ->•- 


p 


1 nn co aa 1 "> - 


w 

H 


1 nn ca . 00— « 
1 . UU DO ..J 1 • 




1- nn a q cn 

x . uu . «* 0 . o u.». ■■ ... 


p 


1 n n ji *3 0 a ■ •« 
1 . UU' . ofl .. ♦ 


p. 


1 nn is.. * -.f 
l.UU, 9J.. <o Vi ;',i' 


M 
a 


1 nn 00 00 
X.UU Jr. y<£ 


p 


1 nn oc dc 

X . UU JD.gu 


p 
\t 


i nn on 1 a 
X • UU to. /4 


u 


1 nn oc on 

X.UU JO. JU 


p 


i nn 0.0 oq 
l.UU JZ.Jb 


n 

U 


1 nn oc 10 
l.UU Jj.li 


VI 

W 


i nn oc 00 
X . UU JO . 


p 


1 nn 00 oc 
1 .UU J / . /D 


p 


1 nn 00 

1 . UU J / . fl X 


p 


1 nn iin co 
1 . UU 4U . do 


C 


i nn ofi n 

X . UU JO . x 1 


p 


1 nn oc i 0 
1 . UU jj . 1 J 


c 


1 nn An oi 
X . UU H u . / 1 


p 
u 


i nn 97 

X . UU Oc . £ f 




1 nn in C7 

x.UU J4 . 9 f 




1 nn "\a Q7 

X . UU Jl* • u 1 




1 00 36 94 


c 


1 00 39 85 


c 


1.00 44.43 


0 


1 00 43 56 


0 


1.00 33.02 


c 


1.00 34.43 


0 


1.00 37.22 


N 


1.00 4?. 00 


c 


1.00 46.83 


c 


1.00 57.73 


c 


1.00 60.37 


0 


1.00 67.82 


0 


1.00 38.42 


c 


1.00 40.11 


0 


1.00 33.53 


N 


1.00 28.97 


c 


1.00 28.17 


c 


1.00 25.88 


c 


1.00 30.70 


c 



SUBSTITUTE SHEET (RULE 26) 



WO 03/035693 



ATOM 


9317 


NE 


ARG 


B 


144 


ATOM 


9319 


CZ 


ARG 


B 


144 


ATOM 


9320 


NH1 


ARG 


B 


144 


ATOM 


9323 


NH2 


ARG 


B 


144 


ATOM 


9326 


c 


ARG 


B 


144 


ATOM 


9327 


0 


ARG 


B 


144 


ATOM 


9328 


N 


VAL 


B 


145 


ATOM 


9330 


CA 


VAL 


B 


145 


ATOM 


9332 


CB 


VAL 


B 


145 


ATOM 


9334 


CGI 


VAL 


B 


145 


ATOM 


9338 


CG2 


VAL 


B 


145 


ATOM 


9342 


c 


VAL 


B 


145 


ATOM 


9343 


0 


VAL 


B 


145 


ATOM 


9344 


N 


GLN 


B 


146 


ATOM 


9346 


CA 


GLN 


B 


146 


ATOM 


9348 


CB 


GLN 


B 


146 


ATOM 


9351 


CG • 


GLN 


B 


146 


ATOM 


9354 


CD 


GLN 


B 


146 


ATOM 


9355 


OE1 


GLN 


B 


146 


ATOM 


9356 


NE2 


GLN 


B 


146 


ATOM 


9359 


c 


GLN 


B 


146 


ATOM 


9360 


0 


GLN 


B 


146 


ATOM 


9361 


N 


GLU 


B 


147 


ATOM 


9363 


CA 


GLU 


B 


147 


ATOM 


9365 


CB 


GLU 


B 


147 


ATOM 


9368 


CG 


GLU 


B 


147 


ATOM 


9371 


CD 


GLU 


B 


147 


ATOM 


9372 


OE1 


GLU 


B 


147 


ATOM 


9373 


OE2 


GLU 


B 


147 


ATOM 


9374 


C 


GLU 


B 


147 


ATOM 


9375 


0 


GLU 


B 


147 


ATOM 


9376 


N 


GLU 


B 


148 


ATOM 


9378 


CA 


GLU 


B 


148 


ATOM 


9380 


CB 


GLU 


B 


148 


ATOM 


9383 


CG 


GLU 


B 


148 


ATOM 


9386 


CD 


GLU 


B 


148 


ATOM 


9387 


OEl 


GLU 


B 


148 


ATOM 


9388 


OE2 


GLU 


B 


148 


. ATOM 


9389 


c 


GLU 


B 


148 


ATOM 


9390 


0 


GLU 


B 


148 


ATOM 


9391 


N 


ALA 


B 


149 


ATOM 


9393 


CA 


ALA 


B 


149 


ATOM 


9395 


CB 


ALA 


B 


149 


ATOM 


9399 


c 


ALA 


B 


149 


ATOM 


9400 


o 


ALA 


B 


149 


ATOM 


9401 


N 


ARG 


B 


150 


ATOM 


9403 


CA 


ARG 


B 


150 


ATOM 


9405 


CB 


ARG 


B 


150 


ATOM 


9408 


CG 


ARG 


B 


150 


ATOM 


9411 


CD 


ARG 


B 


150 


ATOM 


9414 


NE 


ARG 


B 


150 


ATOM 


9416 


CZ 


ARG 


B 


150 


ATOM 


9417 


NH1 


ARG 


B 


150 


ATOM 


9420 


NH2 


ARG 


B 


150 


ATOM 


9423 


c 


ARG 


B 


150 


ATOM 


9424 


0 


ARG 


B 


150 


ATOM 


9425 


N 


CYS 


B 


151 


ATOM 


9427 


CA 


CYS 


B 


151 


ATOM 


9429 


CB 


CYS 


B 


151 


ATOM 


9432 


SG 


CYS 


B 


151 


ATOM 


9433 


C 


CYS 


B 


151 


ATOM 


9434 


O 


CYS 


B 


151 


ATOM 


9435 ' 


N 


LEU 


B 


152 


ATOM 


9437 


CA 


LEU 


B 


152 


ATOM 


9439 


CB 


LEU 


B 


152 


ATOM 


9442 


CG 


LEU 


B 


152 


ATOM 


9444 


CDl 


LEU 


B 


152 


ATOM 


9448 


CD2 


LEU 


B 


152 


ATOM 


9452 


C 


LEU 


B 


152 


ATOM 


9453 


0 


LEU 


B 


152 


ATOM 


9454 


N 


VAL 


B 


153 


ATOM 


9456 


CA 


VAL 


B 


153 


ATOM 


9458 


CB 


VAL 


B 


153 


ATOM 


94 60 


CGI 


VAL 


B 


153 


ATOM 


9464 


CG2 


VAL 


B 


153 


ATOM 


9468 


C 


VAL 


B 


153 



360/514 



Figure 5 

55.701 62.084 11.984 

56.326 62.534 10.918 

57.059 61.703 10.192 

56.211 63.805 10.567 

53.485 56.317 12.701 

54.121 55.299 12.505 
52.574 56.416 13.665 
52.345 55.340 14.593 
51.604 55.819 15.837 
51.306 54.676 16.760 
52.429 56.839 16.559 
51.618 54.206 13.936 
51.844 53.043 14.290 
50.744 54.508 12.982 
50.059 53.435 12.230 
48.923 53.975 11.384 
47.680 54.253 12.209 
46.565 54.834 11.394 
45.550 54.177 11.201 
46.754 56.049 10.883 
51.023 52.655 11.357 
50.849 51.452 11.194 
52.051 53.325 10.830 
53.032 52.657 9.981 
53.937 53.654 9.235 
54.961 52.968 8.318 
56.071 53.895 7.797 
56.244 55.016 8.326 
56.797 53.491 6.851 
53.870 51.704 10.835 
54.036 50,527 10.489 
54.372 52.222 11.953 
55.138 51.422 12.894 
55.559 52.268 14.095 
56.753 51.699 14.832 
57.947 51.495 13.918 
58.277 52.453 13.156 
58.540 50.388 13.963 
54.339 50.223 13.382 
54.854 49.132 13.544 
53.059 50.434 13.593 
52.183 49.372 14.035 
50.792 49.919 14.228 
52.168 48.181 13.066 
52.093 47.028 13.501 
52.231 48.429 11.759 
52.189 47.287 10.843 
51.397 47.558 9.545 
51.954 48.548 B.566 
51.516 48.303 7;102 
52.660 48.453 6.199 
53.187 49.625 5.834 
54.241 49.656 5.023 
52.665 50.775 6.267 
53.569 46.657 10.624 
53.671 45.455 10.409 
54.623 47.455 10.755 
55.972 46.934 10.741 
56.976 48.064 10.770 
56.862 49.090 9.324 
56.265 46.083 11.943 
57.156 45.248 11.886 
55.578 46.353 13.047 
55.758 45.625 14.285 
55.073 46.383 15.422 
55.385 45.957 16.857 
54.762 46.954 17.825 
54.876 44.576 17.166 

55.122 44.255 14.105 
55.663 43.246 14.525 
53.957 44.247 13.467 
53.224 43.019 13.182 
51.830 43.329 12.576 
51.182 42.074 12.024 
50.926 43.969 13.610 
54.010 42.175 12.189 



1.00 36.88 
1.00 39.36 
1.00 42.29 
1.00 39.49 
1.00 30.36 
1.00' 29.10 
l.OO 28.03 
1.00 24.69 
1.00 23.14 
1.00 20.62 
1.00 24.00 
1.00 29.04 
1.00 32.30 
1.00 33.11 
1.00 32.04 
1.00 31.48 
1.00 37.50 
1.00 38.80 
1.00 43.89 
1.00 36.94 
1.00 32.36 
1.00 33.03 
1.00 33.23 
1.00 36.76 
1.00 38.53 
1.00 41.39 
1.00 45.64 
1.00 46.78 
1.00 48.95 
1.00 35.60 
1.00 31.38 
1.00 30.23 
1.00 31.41 
1.00 32.38 
1.00 33.68 
1.00 38.00 
1.00 37.51 
1.00 33.06 
1.00 30.12 
1.00 28.18 
1.00 33.45 
1.00 34.62" 
1.00 34.50 
1.00 36.19 
1.00 30.78 
1.00 39.61 
1.00 44.05 
1.00 45.45 
1.00 50.36 
1.00 52.98 
1.00 56.92 
1.00 60.47 
1.00 62.22 
1.00 61.13 
1.00 44.70 
1.00 47.87 
1.00 41.78 
1.00 40.86 
1.00 41.98 
1.00 47.07 
1.00 43.60 
1.00 50.81 
1.00 43.76 
1.00 44.36 
1.00 46.67 
1.00 50.64 
1.00 54.44 
1.00 50.97 
1.00 43.27 
1.00 35.31 
1.00 46.77 
1.00 48.40 
1.00 46.17 
1.00 43.13 
1.00 45.13 
1.00 51.42 



SUBSTITUTE SHEET (RULE 26) 



WO 03/035693 



PCT/GB02/04872 



361/514 



Figure 5 



ATOM 


9469 


0 


VAL 


B 


153 


54.082 


40.974 


12.324 


1.00 


52.52 


0 








ATOM 


9470 


N 


GLU 


B 


154 


54.589 


42.823 


11.184 


1.00 


57.00 


N 








ATOM 


9472 


CA 


GLU 


B 


154 


55.427 


42.148 


10.202 


1.00 


58.60 


C 








ATOM 


9474 


CB 


GLU 


B 


154 


55.874 


43.122 


9.067 


1.00 


62.55 


C 








ATOM 


79H 1 1 


CG 


GLU 


B 


154 


54.859 


43.259 


7 . 906 


1.00 


66.72 


c 








ATOM 


9480 


CTi 


GLU 


B 


154 


54.905 


44.602 


7.135 


1.00 


72.72 


C 












OE1 


GLU 


B 


154 


53.896 


44 . 959 


6. 468 


1.00 


76.67 


o 








Aiun 


94*62 


OE2 


GLU 


B 


154 


55 . 935 


45.315 


7.166 


1.00 


71.91 


o 








aiuh 


7 4 0 J 


o 

V* 


GLU 




154 


56. 607 


41.481 


10. 929 


1.00 


55.96 


c 








AlUM 


a a 

7909 


Oj 


or.n 

VJJJU 


□ 
o 


154 


56. 969 


40 .379 


10.578 


1.00 


60.56 


o 








aiuh 






GLU 




155 


57.133 


42.106 


11 . 978 


1.00 


53.69 


N 








ATOM 




OA 


oT.n 


Q 


155 


58 .260 


41.543 


12 .732 


1.00 


53 .73 


c 








ATOM 


7907 




GLU 




155 


59. 065 


42 . 645 


13. 420 


1.00 


51.29 


c 








ATOM 


QA Q9 
79 71 


CG 


GLU 


B 


155 


60. 146 


43. 155 


12 . 486 


1.00 


58.97 


c 








ATOM 


7973 


on 




B 


155 


61 . 487 


42 . 407 


12 .597 


1.00 


62.29 


c 








ATOM 


9496 




OT»n 


B 


155 


62.014 


42.030 


11.534 


1.00 


63.39 


o 








ATOM 


Q*7 


VJE>£ 


\aLtVJ 


B 


155 


62.029 


42.212- 


13.716 


1.00 


64. 60 


o 








ATOM 


9498 




GLU 


B 


155 


57.901 


40.441 


13.731 


1.00 


52.03 


c 








ATOM 


9499 


Q 


GLU 


B 


155 


58.744 


39.606 


14 .071 


1.00 


52.47 


0 








ATOM 


9500 




LEU 


B 


156 


56.665 


40.430 


14 .203 


1.00 


50.66 


N 








ATOM 


9502 


CA 


LEU 


B 


156 


56.225 


39.358 


15.072 


1.00 


50.62 


c 








ATOM 


9504 


CB 


LEU 


B 


156 


54.974 


39.756 


15.B74 


1.00 


49.09 


c 








ATOM 


9507 


CG 


LEU 


B 


156 


55.118 


40.984 


16.785 


1.00 


48.78 


c 








ATOM 


9509 


CDl 


LEU 


B 


156 


53.795 


41.305 


17.454 


1.00 


49.37 


c 








ATOM 


9513 


CD2 


LEU 


B 


156 


56.196 


40.839 


17.846 


1.00 


49.61 


C 








ATOM 


9517 


c 


LEU 


B 


156 


55.972 


38.155 


14.184 


i.oo 


49.95 


c 








ATOM 


9518 


o 


LEU 


B 


156 


56.234 


37.021 


14.568 


1.00 


49.39 


0 








ATOM 


9519 


N 


ARG 


B 


157 


55.500 


38.418 


12.974 


1.00 


51.43 


N 








ATOM 


9521 


CA 


ARG 


B 


157 


55.271 


37.370 


11.989 


1.00 


53.04 


C 








ATOM 


9523 


CB 


ARG 


B 


157 


54 .772 


37.984 


10.690 


1.00 


50.62 


c 








ATOM 


9526 


CG ' 


ARG 


B 


157 


53.512 


37.371 


10.166 


1.00 


50.34 


c 








ATOM 


9529 


CD 


ARG 


B 


157 


53.020 


38.074 


8.903 


1.00 


54.97 


c 


• .* \ I X i 






ATOM 


9532 


NE 


ARG. 


B 


157 


51 .922 


39.012 


9.144 


1.00 


54.25 


N 






I- 


ATOM 


9534 


CZ** 


ARG 


B 


157 


50. 688 


38.642 


9.503 


1.00 


53.55 


c 


:\T<'> 


" 




ATOM 


9535 


NHl 


ARG B 


157 


50.381 


37.356 


9.707 


1.00 


53.02 


N 


' \ V; • 




t '~ 


a TOM 


73 JO 


NH2 


ARG 


B 


157 


49.763 


39.567 


9.700 


1.00 


50.07 


N 


■ n,.' 


j': \. 


• s 


ATOM 
AlUn 


7391 


r» 


ARG 


B 


157 


56.551 


36.561 


11.723 


1.00 


55.90 


C 








ATOM 


73 9Z 


' Oj ' 


ARG 


B 


157 


56.510 


35.342 


11.591 


1.00 


56.09 


0 




• ' *J. 




a TOM 

Aiun 


739 J 


"W 


LYS 


.B 


158 


57.681 


37.259 


11. 670 


1.00 


59.26 


N 




i *J . 




AlUM 




■ OA- - 


LYS 


B 


158 


58 . 986 


36. 653 


11.411 


1.00 


60.18 


c 








aium 






LYS 


B 


158 


60.008 


37.742 


11.055 


1.00 


60.07 


c 




■;.;* 




ATOM 


733U 


om 


LYS 


B 


158 


59 . 794 


38.310 


9. 669 


1.00 


60.96 


c 








ATOM 


733J 


on 


LYS 


B 


158 


60 . 481 


39. 648 


9.470 


1.00 


64.67 


c 








ATOM 

Aiun 


7330 


CE 


LYS 


B 


158 


60.075 


40.267 


8.121 


1.00 


67.18 


c 








ATOM 

Aiun 


7337 


NZ 


LYS 


B 


158 


60.945 


41.423 


7.766 


1.00 


67.52 


N 








ATOM 


7303 


r» 
I- 


LYS 


B 


•L30 


59 . 532 


35.797 


12.558 


1.00 


61.17 


c 








ATOM 


Q c cn 
7309 


Oj 


LYS 


B 


158 


60. 516 


35.094 


12.366 


1.00 


65.39 


0 








ATOM 


7 3 03 


• W 


THR 


B 


J.3 7 


58 . 924 


35. 845 


13.741 


1.00 


59.80 


N 








ATOM 


73 0 / 


OA 


THR 


B 




59 . 310 


34.917 


14 .807 


1. 00 


57.38 


C 








ATOM 
A1UM 


7307 


CB 


THR 


B 


159 


58 . 875 


35.409 


16.206 


1.00 


57.65 


C 








ATOM 


9571 


OX51 


THR 


B 


159 


57 . 446 


35.516 


16.276 


1.00 


56.26 


0 








ATOM 


9573 


CG2 


THR 


B 


159 


59. 399 


36.810 


16.485 


1.00 


55.57 


c 








ATOM 


9577 




THR 


B 


159 


58 .718 


33.548 


14.547 


1.00 


57.78 


c 








ATOM 


9576 


o 


THR 


B 


159 


58.907 


32.640 


15.34 6 


1.00 


56.78 


0 








ATOM 


9579 


N 


LYS 


B 


160 


57.976 


33.423 


13.449 


1.00 


58.54 


N 








ATOM 


9581 


CA 


LYS 


B 


160 


57.531 


32.137 


12.917 


1.00 


61.50 


c 








ATOM 


9583 


CB 


LYS 


B 


160 


58.751 


31.333 


12.433 


1.00 


65.46 


c 








ATOM 


9586 


CG 


LYS 


B 


160 


59.220 


31.689 


11.014 


1.00 


69.73 


c 








ATOM 


9589 


CD 


LYS 


B 


160 


60.637 


31.145 


10.710 


1.00 


72.41 


c 








ATOM 


9592 


CE 


LYS 


B 


160 


60.639 


29.664 


10.292 


1.00 


73.83 


c 








ATOM 


9595 


NZ 


LYS 


B 


160 


62.017 


29.122 


9.981 


1.00 


74.41 


N 








ATOM 


9599 


c 


LYS 


B 


160 


56. 663 


31.318 


13.896 


1.00 


60.43 


C 








ATOM 


9600 


0 


LYS 


B 


160 


56.540 


30.105 


13.766 


1.00 


61.44 


0 








ATOM 


9601 


N 


ALA 


B 


161 


56.039 


32.000 


14.858 


1.00 


60.02 


N 








ATOM 


9603 


CA 


ALA 


B 


161 


55.159 


31.374 


15.854 


1.00 


55.48 


C 








ATOM 


9605 


CB 


ALA 


B 


161 


54.086 


30.547 


15.190 


1.00 


53.11 


C 








ATOM 


9609 


C 


ALA 


B 


161 


55.920 


30.542 


16.870 


1.00 


55.99 


C 








ATOM 


9610 


O 


ALA 


B 


161 


55.332 


29.741 


17.575 


1.00 


58.27 


0 








ATOM 


9611 


N 


SER 


B 


162 


57.226 


30.763 


16.960 


1.00 


58.85 


N 








ATOM 


9613 


CA 


SER 


B 


162 


58.090 


30.086 


17 . 922 


1.00 


58.89 


C 








ATOM 


9615 


CB 


SER 


B 


162 


59.477 


29.840 


17.309 


1.00 


60.03 


c 








ATOM 


9616 


OG 


SER 


B 


162 


60.297 


30.997 


17.449 


1.00 


62.56 


0 








ATOM 


9620 


.C 


SER 


B 


162 


58.248 


30.959 


19.158 


1.00 


54.99 


c 








ATOM 


9621 


O 


SER 


B 


162 


58.069 


32.162 


19.074 


1.00 


56.86 


0 








ATOM 


9622 


N 


PRO 


B 


163 


58.640 


30.364 


20.266 


1.00 


54.45 


N 








ATOM 


9623 


CA 


PRO 


B 


163 


58.812 


31.102 


21.533 


1.00 


52.00 


c 









SUBSTITUTE SHEET (RULE 26) 



WO 03/035693 



ATOM 


9625 


CB 


rm/ o J. dj 


ATOM 


9628 


CG 


PWJ o lOJ 


ATOM 


9631 


CD 


boa n i c "3 
PRO o lOJ 


ATOM 


9634 


C 


nnft n 1 CI 

rKU a ±oj 


ATOM 


9635 


0 


rKU D IOj 


ATOM 


9636 


N 


pvc T3 1 &A 
UIO D 104 


ATOM 


9638 


CA 


nvQ n 1 &A 

Lis a ixjh 


ATOM 


9640 


CB 


»VC O 1 C%A 

LIS D 104 


ATOM 


9643 


SG 


nvC n 1 (\A 
WIS o 1D4 


ATOM 


9644 


C 


r»vc D 1 GA 
ClS B 104 


ATOM 


9645 


0 


CYS B 104 


ATOM 


9646 


N 


Ab r O 1 0 D 


ATOM 


9648 


CA 


ASP B lOD 


ATOM 


9650 


CB 


ASP o 103 


ATOM 


9653 


CG 


ASP B 103 


ATOM 


9654 


ODl 


ASP B loo 


ATOM 


9655 


OD2 


ASP B 165 


ATOM 


9656 


C 


ASP B 165 


ATOM 


9657 


0 


ASP B 165 


ATOM 


9658 


N 


PRO B 166 


ATOM 


9659 


CA 


PRO B 166 


ATOM 


9661 


CB 


PRO B 166 


ATOM 


9664 


CG 


PRO B 166 


ATOM 


9667 


CD 


PRO B 166 


ATOM 


9670 


C 


PRO B 166 


ATOM 


9671 


0 


PRO B 166 


ATOM 


9672 


N 


THR B 167 


ATOM 


9674 


CA 


THR B 167 


ATOM 


9676 


CB 


THR B 167 


ATOM 


9678 


OGl THR B 167^ 


ATOM 


9680 


CG2 


THR B 167 


ATOM 


9684 


C 


THR B 167 


ATOM 


9685 


0 


THR B 167-* 


ATOM 


9686 


N 


PHE B 168 ;. 


ATOM 


9688 


CA 


PHE B 168 


ATOM 


9690 


CB 


PHE B 168- v 


ATOM 


9693 


CG 


PHE B 168' 


ATOM 


9694 


CD1 


PHE B 168 


ATOM 


9696 


CE1 PHE B 168" 


ATOM 


9698 


CZ 


PHE B 168 


ATOM 


9700 


CE2 


PHE B 168» 


ATOM 


9702 


CD2 


PHE B 168 


ATOM 


9704 


C 


PHE B 168 


ATOM 


9705 


0 


PHE B 168 


ATOM 


9706 


N 


ILE B 169 


ATOM 


9708 


CA 


ILE B 169 


ATOM 


9710 


CB 


ILE B 169 


ATOM 


9712 


CGI 


ILE B 169 


ATOM 


9715 


CD1 


ILE B 169 


ATOM 


9719 


CG2 


ILE B 169 


ATOM 


9723 


C 


ILE B 169 


ATOM 




0 


ILE B 169 


ATOM 


9725 


N 


LEO B 170 


ATOM 




CA 


LEU B 170 


ATOM 


9729 


CB 


LEO B 170 


ATOM 


y / 


CG 


LEO B 170 


ATOM 


QT3 A 

y / 


CD1 


LEU B 170 


ATOM 


J / JO 


CD2 


LEU B 170 


ATOM 


y / h c 


C 


LEU B 170 


ATOM 


y / h j 


0 


LEU B 170 


ATOM 


0 *7 A A 


N 


GLY B 171 


ATOM 


9746 


CA 


GLY B 171 




9749 


C 


GLY B 171 


ATOM 


9750 


0 


GLY B 171 


ATOM 


9751 


N 


CYS B 172 


ATOM 


9753 


CA 


CYS B 172 


ATOM 


9755 


CB 


CYS B 172 


ATOM 


9758 


SG 


CYS B 172 


ATOM 


9759 


C 


CYS B 172 


ATOM 


9760 


0 


CYS B 172 


ATOM 


9761 


N 


ALA B 173 


ATOM 


9763 


CA 


ALA B 173 


ATOM 


9765 


CB 


ALA B 173 


ATOM 


9769 


C 


ALA B 173 


ATOM 


9770 


0 


ALA B 173 


ATOM 


9771 


N 


PRO B 174 



362/514 



Figure 5 

59.467 30.079 22.456 1.00 54.27 

59.035 28.786 21.952 1.00 53.17 

58.980 28.941 20.464 1.00 54.83 

59.761 32.235 21.335 1.00 51.37 

60.692 32.131 20.548 1.00 53.81 

59.533 33.314 22.060 1.00 53.15 

60.316 34.513 21.872 1.00 50.95 

59.871 35.216 20.588 1.00 52.39 

58.919 36.701 20.857 1.00 44.47 

60.211 35.421 23.090 1.00 47.26 

59.233 35.366 23.818 1.00 43.94 

61.271 36.192 23.313 1.00 44.11 

61.343 37.196 24.349 1.00 42.83 

- 62.719 37.235 24.996 1.00 40.07 

62.869 38.3B3 25.988 1.00 44.60 

63.987 38.557 26.541 1.00 38.75 

61.921 39.159 26.298 1.00 46.40 

61.056 38.531 23.651 1.00 47.24 

• 61.902 39.072 22.885 1.00 45.76 

59.867 39.074 23.924 1.00 45.75 

59.384 40.258 23.211 1.00 40.43 

57.945 40.376 23.697 1.00 42.06 

57.956 39.791 25.037 1.00 42.71 

58.898 38.622 24.944 1.00 44.55 

60.161 41.523 23.533 1.00 32.47 

60.002 42.504 22.837 1.00 32.76 

61.006 41.512 24.546 1.00 30.17 

61.677 42.755 24.977 1.00 30.91 

62.843 42.448 25.902 1.00 27.98 

62.401 41.644 27.003 1.00 31.52 

.63.372 43.721 26.539 1.00 29.26 

.62.233 43.615 23.867 1.00 32.69 

-62.137 44.828 23.921 1.00 39.39 

62.851 42.985 22.880 1.00 36.43 

63.642- 43.706 21.913 1.00 35.88 

64.681 42.785 21.253 1.00 38.23 

65;364 43.420 20.074 1.00 34.51 

65.065' 43.020 18.780 1.00 37.06 

65.674 43.643 17.695 1.00 38.50 

• 66.590 44.666 17.920 1.00 37.97 

. 66.881- 45.068 19.209 1.00 32.99 

66.262 44.458 20.271 1.00 31.85 

62.758 44.323 20.840 1.00 34.83 

62.877 45.510 20.548 1.00 32.07 

61.930 43.476 20.230 ■ 1.00 32.19 

60.978 43.876 19.205 1.00 32.87 

60.151 42.657 18.794 1.00 33.57 

60.920 41.786 17.799 1.00 37.06 

60.219 40.434 17.439 1.00 36.02 

58.866 43.088 18.163 1.00 37.91 

60.034 44.979 19.716 1.00 32.65 

59.821 45.990 19.064 1.00 31.29 

59.459 44.754 20.885 1.00 33.03 

'58.642 45.749 21.553 1.00 32.36 

57.998 45.126 22.777 1.00 33.67 

56.534 44.747 22.712 1.00 36.61 

56.067 44.385 21.322 1.00 38.24 

56.306 43.599 23.663 1.00 38.59 

59.406 .46.991 21.985 1.00 31.49 

58.800 47.984 22.276 1.00 39.89 

60.717 46.944 22.092 1.00 32.67 

61.472 48.155 22.353 1.00 32.89 

61.707 48.921 21.061 1.00 34.33 

61/864 50.148 21.035 1.00 30.48 

61.766 48.177 19.971 1.00 35.93 

62.158 48.756 18.696 1.00 40.19 

62.360 47.654 17.635 1.00 43.64 

63.928*46.762 17.773 1.00 45.69 

61.084 49.735 18.254 1.00 38.35 

61.396 50.811 17.796 1.00 33.84 

59.820 49.352 18.456 1.00 34.64 

58.686 50.073 17.926 1.00 31.50 

57.421 49.263 18.133 1.00 33.46 

58.530 51.497 18.472 1.00 31.08 

58.388 52.435 17.695 1.00 27.21 

58.545 51.673 19.788 1.00 28.92 



SUBSTITUTE SHEET (RULE 26) 



WO 03/035693 



363/514 



ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATQM- 
ATOM 
ATOM 
ATOM ' 
ATOM • 
' ATOM- 
ATOM 
- ATOM v 
' ATOM< : 
ATOM V 
ATOM- 
ATOM- 
' ATOM" 
ATOM 1 

atom' ' 

ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 



9772 
9774 
9777 
9780 
9783 
9784 
9785 
9787 
9789 
9792 
9793 
9794 
9795 
9797 
9799 
9802 
9803 
9804 
9807 
9808 
9809 
9811 
9813 
9815 
9819 
9823 
9824 
9825 
9827 
9829 
9831 
9834 
9838 
9842 
9843 
9844 
•9846 
9848 
9851 
-9852 
9853 
9854 
9856 
9858 
9861 
9863 
9864 
9865 
9867 
9869 
9871 
9874 
9878 
9882 
9883 
9884 
9886 
9888 
9890* 
9893 
9897 
9901 
9902 
9903 
9905 
9907 
9910 
9911 
9913 
9915 
9917 
9919 
9921 
9922 
9923 
9925 



PRO B 174 
PRO B 174 
PRO B 174 
PRO B 174 
PRO B 174 
PRO B 174 
CYS B 175 
CYS B 175 
CYS B 175 
CYS B 175 
CYS B 175 
CYS B 175 
ASN B 176 
ASN B 176 
ASN B 176 
ASN B 176 
OD1 ASN B 176 
ND2 ASN B 176 
ASN B 176 
ASN B 176 
VAL B 177 
VAL B 177 
VAL B 177 
CGI VAL B 177 
CG2 VAL B 177 



C 
O 
N 

CA 

CB 

CGI 

CD1 

CG2 

C 

O 

N 

CA 

CB 

SG 

C 

O 

N 

CA 

CB 

OG 

C 

O 

N 

CA 
CB 
CGI 
CD1 



VAL B 177 
VAL B 177 
ILB B 178 
ILE B 178 
ILE B 178 
ILE B 178 
ILE B 178 
ILE B 178 
ILE B 178 
ILE B 178 
CYS B 179 
CYS B 179 
CYS B 179 
CYS B 179 
CYS B 179 
CYS B 179 
SER B 180 
SER B 180 
SER B 180 
SER B 180 
SER B 180 
SER B 180 
ILE B 181 
ILE B 181 
ILE B 181 
ILE B 181 
ILE B 181 



CG2 ILE B 181 



C 
O 
N 

CA 
CB 
CGI 
CD1 



ILE B 181 
ILE B 181 
ILE B 182 
ILE B 182 
ILE B 182 
ILE B 182 
ILE B 182 



CG2 ILE B 182 



C 
O 
N 

CA 

CB 

CG 

CD1 

CE1 

cz 

CE2 

CD2 

C 

0 

N 

CA 



ILE B 182 
ILE B 182 
PHE B 183 
PHE B 183 
PHE B 183 
PHE B 183 
PHE B 183 
PHE B 183 
PHE B 183 
PHE B 183 
PHE B 183 
PHE B 183 
PHE B 183 
HIS B 184 
HIS B 184 



58.459 

58.371 

58.131 

58.611 

59.678 

59.578 

60.840 

62.029 

63.251 

64.745 

61.949 

62.344 

61.459 

61.269 

60.894 

61.768 

62.683 

61.457 

60.191 

60.283 

59.175 

58.142 

56.965 

56.062 

56.128 

58.818 

58.583 

59.683 

60.422 

61.193 

60.278 

59.409 

62.246 

61.335 

61.436 

61.955 

62.827 

63.419 

64.699 

62.082 

62.599 

60.864 

60.041 

58.774 

5B.241 

59.729 

59.990 

59.239 

58.930 

58.380 

56.951 

56.555 

58.413 

60.125 

59.940 

61.336 

62.491 

63.571 

62.958 

63.119 

64.614 

63.148 

63.472 

63.314 

64.051 

64.936 

65.813 

66.123 

66.921 

67.442 

67.155 

66.339 

63.219 

63.783 

61.898 

60.893 



Figure 5 

53.011 20.354 



52.755 

51.327 

50.640 

53.875 

55.081 

53.269 

54.030 

53.143 

54.093 

54.611 

55.769 

53.779 

54.146 

52.893 

52.685 

53.463 

51.636 

55.227 

56.061 

55.220 

56.249 

56.018 

57.236 

54.862 

57.571 

58.522 

57.625 

58.858 

58.805 

59.113 

57.987 

59.899 

59.228 

60.388 

58.241 

58.488 

57.180 

56.413 

59.177 

60.115 

58.707 

59.258 

58.457 

58.698 

60 . 692 

61.588 

60.930 

62.287 

62.272 

61.724 

61.116 

63.665 

63.235 

64.413 

62.766 

63.686 

63.282 

62.906 

63.900 

64.386 

63.B22 

64.938 

62.695 

62.616 

61.351 

61.213 

59.956 

59.803 

60.920 

62.190 

62.331 

62.584 

62.623 

62.540 

62.295 



21.864 
22.043 
20.827 
20.006 
19.797 
19.950 
19.602 
19.677 
19.469 
18.191 
17.936 
17.284 
15.888 
15.077 
13.845 
13.570 
13.081 
15.709 
14.802 
16.572 
16.511 
17.489 
17.5.60 
17.046 
16.773 
16.041 
17.789 
18.080 
19.431 
20.603 
20.969 
19.462 
16.905 
16.557 
16.266 
15.097 
14.555 
15.579 
13.963 
13.341 
13.708 
12.656 
12.486 
11.204 
12.993 
12.190 
14.210 
14.663 
16.097 
16.123 
17.490 
16.712 
14.595 
14.287 
14.895. 
14.918 
15.949 
17.321 
18.383 
16.022 
13.542 
13.138 
12.839 
11.556 
11.561 
12.772 
13.262 
14.385 
15.024 
14.532 
13.411 
10.242 
9.162 
10.361 
9.284 



1.00 27.83 
1.00 28.29 
1.00 28.37 
1.00 28.79 
1.00 29.92 
1.00 30.02 
1.00 32.90 
1.00 38.05 
1.00 39.99 
1.00 43.27 
1.00 37.12 
1.00 36.00 
1.00 33.45 
1.00 34.90 
1.00 37.22 
1.00 37.79 
1.00 38.53 
1.00 32.25 
1.00 32.54 
1.00 30.28 
1.00 32.48 
1.00 29.47 
1.00 27.20 
1.00 32.51 
1.00 25.70 
1.00 29.44 
1.00 27.49 
1.00 33.14 
1.00 32.91 
1.00 33.73 
1.00 37.05 
1.00 48.50 
1.00 35.49 
1.00 29.84 
1.00 29.31 
1.00 34.56 
1.00 35.27 
1.00 36.73 
1.00 43.67 
1.00 31.48 
1.00 29.79 
1.00 30.22 
1.00 29.06 
1.00 30.56 
1.00 33.62 
1.00 34.26 
1.00 38.73 
1.00 35.30 
1.00 32.27 
1.00 33.41 
1.00 35.78 
1.00 34.35 
1.00 32.36 
1.00 34.08 
1.00 34.63 
1.00 35.33 
1.00 37.31 
1.00 38.28 
1.00 38.37 
1.00 38.80 
1.00 34.11 
1.00 3B.37 
1.00 37.64 
1.00 38.93 
1.00 42.19 
1.00 42.12 
1.00 38.57 
1.00 37.36 
1.00 40.00 
1.00 39.73 
1.00 40.54 
1.00 38.17 
1.00 43.73 
1.00 44.42 
1.00 49.53 
1.00 55.63 



SUBSTITUTE SHEET (RULE 26) 



WO 03/035693 



PCT/GB02/04872 



ATOM 


9927 


CB 


HIS 


B 


184 


ATOM 


9930 


CG 


HIS 


B 


184 


ATOM 


9931 


NDl 


HIS 


B 


184 


ATOM 


9933 


CEl 


HIS 


B 


184 


ATOM 


9935 


NE2 


HIS 


B 


184 


ATOM 


9937 


CD2 


HIS 


B 


184 


ATOM 


9939 


q 


HIS 


B 


184 


ATOM 


9940 


o 


HIS 


B 


184 


ATOM 


9941 


N 


LYS 


B 


185 


niun 


9943 




LYS 


B 


185 






CB 


LYS 


B 


185 


ttiura 


9948 


CG 


LYS 


B 


185 


ATOM 


9951 


CD 


LYS 


B 


185 


ATOM 


9954 


CE 


LYS 


0 


185 


ATOM 


9957 


NZ 


LYS 


Q 


185 


ATOM 


9961 




LYS 


B 


185 


ATOM 


9962 


o 


LYS 


B 


185 


ATOM 


9963 


N 


ARG 


B 


186 


ATOM 


9965 


CA 


ARG 


B 


186 


ATOM 


9967 


CB 


ARG 


B 


186 


ATOM 


9970 


CG 


ARG 


B 


186 


ATOM 


9973 


CD 


ARG 


B 


186 


ATOM 


9976 


NE 


ARG 


B 


186 


ATOM 


9978 


CZ 


ARG 


B 


186 


ATOM 


9979 


NH1 


ARG 


B 


186 


ATOM 


9982 


NH2 


ARG 


B 


186 


ATOM 


9985 


c 


ARG 


B 


186 


ATOM 


9986 


0 


ARG 


B 


186 


ATOM 


9987 


N 


PHE 


B 


187 


ATOM 


9989 


CA 


PHE 


B 


187 


ATOM 


9991 


CB 


PHE 


B 


187 


ATOM 


9994 


CG 


PHE 


B 


187 


ATOM 


9995 


CD1 


PHE 


B 


187 


ATOM 


9997 


CEl 


PHE 


B 


187 


ATOM 


9999 


CZ 


PHE 


B 


187 


ATOM 
nlUn 


JLUUvX 


CE2 


PHE 


£ 


187 




i nnrn 

X UUU O 


CD2 


PHE 


g 


187 


ft TOM 


i nfifts 

ivUUJ 


Q 


PHE 




187 


ATOM 


10006 


o 


PHE 


B 


187 


ATOM 


10007 


N 


ASP 


B 


188 


ATOM 


10009 


CA 


ASP 


B 


188 


ATOM 


10011 


CB 


ASP 


B 


188 


ATOM 


10014 


CG 


ASP 


B 


188 


ATOM 


10015 


OD1 


ASP 


B 


188 


ATOM 


10016 


OD2 


ASP 


B 


188 


ATOM 


10017 


Q 


ASP 


B 


188 


ATOM 


10018 


o 


ASP 


B 


188 


ATOM 


10019 


N 


TYR 


B 


189 


ATOM 


10021 


CA 


TYR 


B 


189 


ATOM 


10023 


CB 


TYR 


B 


189 


ATOM 


10026 


CG 


TYR 


B 


189 


ATOM 


10027 


CDl 


TYR 


B 


189 


ATOM 


10029 


CEl 


TYR 


B 


189 


ATOM 


10031 


CZ 


TYR 


B 


189 


ATOM 


10032 


OH 


TYR 


B 


189 


ATOM 


10034 


CE2 


TYR 


B 


189 


ATOM 


10036 


CD2 


TYR 


B 


189 


ATOM 


10038 


c 


TYR 


B 


189 


ATOM 


10039 


O 


TYR 


B 


189 


ATOM 


10040 


N 


LYS 


B 


190 


ATOM 


10042 


CA 


LYS 


B 


190 


ATOM 


10044 


CB 


LYS 


B 


190 


ATOM 


10047 


CG 


LYS 


B 


190 


ATOM 


10050 


CD 


LYS 


B 


190 


ATOM 


10053 


CE 


LYS 


B 


190 


ATOM 


10056 


NZ 


LYS 


B 


190 


ATOM 


10060 


C 


LYS 


B 


190 


ATOM 


10061 


O 


LYS 


B 


190 


ATOM 


10062 


N 


ASP 


B 


191 


ATOM 


10064 


CA 


ASP 


B 


191 


ATOM 


10066 


CB 


ASP 


B 


191 


ATOM 


10069 


CG 


ASP 


B 


191 


ATOM 


10070 


OD1 


ASP 


B 


191 


ATOM 


10071 


OD2 


ASP 


B 


191 


ATOM 


10072 


C 


ASP. 


B 


191 


ATOM 


10073 


0 


ASP 


B 


191 



364/514 



Figure 5 
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Figure 5 
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62.933 26.904 1.00 45.68 C 

60.805 28.074 1.00 43.82 C 

61.638 25.005 1.00 48.98 C 

62.400 24.278 1.00 52.52 0 

61.524 24.969 1.00 50.88 N 
62.283 24.026 1.00 47.23 C 
62.405 24.546 1.00 45.04 C 
61.601 22.660 1.00 42.41 C 
62.270 21.631 1.00 43.67 0 
60.274 22.661 1.00 37.92 N 
59.470 21.446 1.00 38.78 C 
57.977 21.784 1.00 36.59 C 

57.070 20.590 1.00 38.09 C 

56.757 20.038 1.00 40.54 C 
55.918 18.925 1.00 38.07 C 
55.384 18.352 1.00 39.38 C 
55.690 18.891 1.00 34.70 C 
56.531 20.006 1.00 37.52 C 

59.758 20.786 1.00 40.38 C 
60.034 19.602 1.00 42.64 0 

59.724 21.587 1.00 45.03 N 
60.038 21.128 1.00 44.91 C 
59.799 22.254 1.00 45.96 C 
58.327 22.362 1.00 44.94 C 
57.B67 23.613 1.00 51.63 S 
58.680 25.058 1.00 46.78 C 
61.468 20.607 1.00 44.94 C 
61.699 19.548 1.00 45.53 0 
62.417 21.331 1.00 40.87 N 
63.815 20.940 1.00 40.76 C 

64.725 21.977 1.00 37.19 C 
65.221 23.014 1.00 38. 72 C 
66.399 23.742 1.00 42.24 C 
66.397 25.183 1.00 46.22 C 
67.392 25.946 1.00 50.91 N 
64.094 19.594 1.00 44.44 C 
64.749 18.753 1.00 46.88 0 
63.621 19.400 1.00 49.58 N 
63.981 18.208 1.00 48.08 C 
63.804 18.409 1.00 46.17 C 

62.526 17.964 1.00 42.60 0 
63.161 17.029 1.00 45.38 C 
63.576 15.894 1.00 48.48 0 
62.014 17.295 1.00 45.27 N 
61.291 16.248 1.00 45.03 C 
59.941 16.756 1.00 40.61 C 
59.300 15.845 1.00 44.20 C 
58.851 14.584 1.00 43.44 C 
58.263 13.733 1.00 47.55 C 
58.115 14.146 1.00 52.95 C 

57.525 13.315 1.00 57.39 0 
58.555 15.405 1.00 50.58 C 
59.159 16.238 1.00 49.74 C 

62.071 15.739 1.00 46.40 C 
62.038 14.567 1.00 49.36 0 
62.755 16.643 1.00 47.66 N 
63.586 16.294 1.00 45.40 C 
63.964 17.560 1.00 45.87 C 
62.808 17.942 1.00 45.67 C 
62.782 19.371 1.00 46.75 C 

-65.239 17.345 1.00 46.14 C 

64.813 15.562 1.00 46.76 C 

65.254 14.621 1.00 46.95 0 

65.356 16.003 1.00 51.05 N 

66.529 15.393 1.00 50.93 C 

66.851 16.182 1.00 52.92 C 

68.202 16.147 1.00 57.19 C 

68.042 15.520 1.00 60.36 C 

69.304 15.461 1.00 55.37 C 

66.221 13.932 1.00 51.61 C 

67.056 13.056 1.00 54.59 0 

64.992 13.677 1.00 51.38 N 

64.527 12.331 1.00 56.33 C 
63.054 12.399 1.00 61.49 C 
62.459 11.223 1.00 66.11 C 
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Figure 5 

64.562 72.863 7.435 

64.800 74.604 7.910 
66.637 74.877 7.724 
64.515 72.747 4.235 
65.250 73.698 3.964 
63.301 72.589 3.723 
62.570 73.671 3.078 
61.432 73.092 2.228 
60.928 74.051 1.171 
60.573 75.194 1.515 
60.831 73.732 -0.035 
62.001 74.529 4.202 
61.292 74.017 5.069 

62.321 75.821 4.191 
61.813 76.767 5.190 
62.452 78.150 4.988 

63.801 78.325 5.676 
63.690 78.296 7.500 
65.506 78.193 7.924 
60.287 76.933 5.209 
59.752 77.429 6.184 
59.606 76.538 4.134 
58.143 76.619 4.019 
57.763 77.151 2.634 
58.645 78.294 2.187 
58.674 79.336 2.822 
59.375 78.100 1.094 
57.415 75.289 4.218 
56.186 75.264 .4.301 
58.172 74.191 4.243. 
57.629 72.837 4.389*. 
57.713 72.081 3.060 
57.028 72.815 1.933. 
55.799 72.857 1.873 
57.818 r 73.429 1.042 
58.361 72.038 5.466. 
59.122 71.119 5.152 
58.131 72.382 . 6.733 
58.682 71.611 7.847 

58.322 72.458 v 9.069 
57.136 73.229 8.664 
57.305 73.508 7.205 
57.983 70.277 7.939 
56.752 70.264 7.829 
58.737 69.194 8.147 
58.153 67.845 8.209 
58.567 67.059 6.972 
58.358 67.807 5.646 
58.504. 66.899 4.428 
57.504 66.484 3.834 
59.749 66.5B0 4.064 
58.488 67.031 9.477 
58.048 65.887 9.612 
59.264 67.596 10.400 
59.664 66.851 11.604 
60.687 65.719 11.282 
61.972 66.234 10.669 
62.663 65.474 9.948 
62.368 67.400 10.856 
60.181 67.779 12.682 
60.236 68.981 12.484 
60.544 67.218 13.825 
60.844 68.006 15.018 
61.104 67.077 16.187 
61.174 67.757 17.494 
60.031 67.940 18.255 
60.089 68.564 19.497 
61.295 69.004 19.980 
62.456 68.808 19.235 
62.389 68.186 17.999 
62.041 68.906 14.790 
62.069 70.060 15.222 
63.026 68.369 14.084 
64.202 69.125 13.740 
65.255 68.210 13.091 
65.754 67.154 14.092 



1.00 84.03 C 

1.00 88.56 S 

1.00 89.69 C 

1.00 78.15 C 

l.OO 74.47 0 

l.OO 77.54 N 

1.00 76.31 C 

1.00 78.24 C 

1.00 79.63 C 

1.00 77.36 O 

1.00 86.06 0 . 

1.00 74.07 C 

1.00 72.65 O 

1.00 72.70 N 

1.00 73.66 C 

1.00 78.63 C 

1.00 84.37 C 

1.00 92.67 S 

1.00 91.18 C 

1.00 68.70 C 

1.00 66.93 0 

1.00 67.17 N 

1.00 65.69 C 

1.00 66.01 C 

1.00 65.31 C 

1.00 65.48 0 

1.00 64.76 N 

1.00 65.51 C 

1.00 64.96 0 

1.00.66.61 N 

1.00.67.86 C 

1.00 69.16 C 

1.00 69.32 C 

1.00 68.53 0 

1.00 6B.49 N 

1.00 67. 3B C 

1.00 71.60 O 

1.00 64.03 N 

1.00 60.29 C 

1.00.61.68 C 

1.00.63.40 C 

1.00 63.19 C 

1.00 56.55 C 

1.00 55.21 0 

1.00 54.18 N 

1.00 50.65 C 

l'.OO 51.99 C 

1.00 56.17 C 

1.00 60.06 C 

1.00 61.16 0 

1.00 60.08 N 

1.00 47.52 C 

1.00 47.37 0 

1.00 43.22 N 

1.00 42.21 C 

1.00 41.05 C 

1.00 43.12 C 

1.00 35.33 0 

1.00 45.36 0 

1.00 36.98 C 

1.00 39.94 0 

1.00 34.72 N 

1.00 35.92 C 

1.00 36.71 C 

1.00 39.61 ■ C 

1.00 43.44 C 

1.00 41.38 C 

1.00 44.10 C 

1.00 44.59 C 

1.00 44.63 C 

1.00 39.07 C 

1.00 37.31 0 

1.00 41.38 N 

1.00 41.92 C 

1.00 45.68 C 

1.00 46.61 C 



SUBSTITUTE SHEET (RULE 26) 



WO 03/035693 



ATOM 


11279 


CDl 


ILE o 


ATOM 


11283 


CG2 


Tin t> £A 

ILE B £09 


ATOM 


11287 


C 


ILE B 264 


ATOM 


11288 


0 


ILE B 2o4 


ATOM 


11289 


N 


ASP B 265 


ATOM 


11291 


CA 


ASP B 265 


ATOM 


11293 


CB 


ASP B 265 


ATOM 


11296 


CG 


ASP B 265 


ATOM 


11297 


OD1 


ASP B 265 


ATOM 


11298 


OD2 


ASP B 265 


ATOM 


11299 


C 


ASP B 265 


ATOM 


11300 


0 


ASP B 265 


ATOM 


11301 


N 


CYS B 266 


ATOM 


11303 


CA 


CYS B 266 


ATOM 


11305 


CB 


CYS B 266 


ATOM 


11308 


SG 


CYS B 266 


ATOM 


11309 


C 


CYS B 266 


ATOM 


11310 


0 


CYS B 266 


ATOM 


11311 


N 


PHE B 267 


ATOM 


11313 


CA 


PHE B 267 


ATOM 


11315 


CB 


PHE B 267 


ATOM 


11318 


CG 


PHE B 26/ 


ATOM 


11319 


CDl 


PHE B 267 


ATOM 


11321 


CE1 


PHE B 267 


ATOM 


11323 


CZ 


PHE B 267 


ATOM 


11325 


CE2 


PHE B 26/ 


ATOM 


11327 


CD2 


PHE B 20/ 


ATOM 


11329 


C 


PHE B & o / 


ATOM 


11330 


0 


PHE B Zb f 


ATOM 


11331 


N 


LEU B 268 


ATOM 


11333 


CA 


LEU B 2 Do 


ATOM 


11335 


CB 


LEU B 2 bo 


ATOM 


11338 


CG 


LEU B £Do 


ATOM 


11340 


CDl 


LEU B 268 


ATOM 


11344 


CD2 


LEU B 268 


ATOM 


11348 


C 


LEU B 268 


ATOM 


11349 


0 


LEU B 268 


ATOM 


11350 


N 


MET B 269 


ATOM 


11352 


CA 


MET B 269 


ATOM 


11354 


CB 


MET B 269 


ATOM 


11357 


CG 


MET B 269 


ATOM 


11360 


SD 


MET B 269 


ATOM 


11361 


cs 


utim o OCQ 
VlCtl O £u3 


ATOM 


11365 


C 


MET B 269 


ATOM 


11366 


0 


MET B 269 


ATOM 


11367 


N 


LYS B 270 


ATOM 


11369 


CA 


LYS B 270 


ATOM 


11371 


CB 


LYS B 270 


ATOM 


11374 


CG 


LYS B 270 


ATOM 


11377 


CD 


LYS B 270 


ATOM- 


11380 


CE 


LYS B 270 


ATOM 


11383 


HZ 


LYS B 270 


ATOM 


11387 


C 


LYS B 270 


ATOM 


11388 


0 


LYS B 270 


ATOM 


11389 


N 


MET B 271 


ATOM 


11391 


CA 


MET B 271 



ATOM 11393 CB MET B 271 

ATOM 11396 CG MET B 271 

ATOM 11399 SD MET B 271 

ATOM 11400 CE MET B 271 

ATOM 11404 C MET B 271 

ATOM 11405 0 MET B 271 

ATOM 11406 N GLU B 272 

ATOM 11408 CA GLU B 272 

ATOM 11410 CB GLU B 272 

ATOM 11413 CG GLU B 272 

ATOM 11416 CD GLU B 272 

ATOM 11417 OE1 GLU B 272 

ATOM 11418 OE2 GLU B 272 

ATOM 11419 C GLU B 272 

ATOM 11420 O GLU B 272 

ATOM 11421 N LYS B 273 

ATOM 11423 CA LYS B 273 

ATOM 11425 CB LYS B 273 

ATOM 11428 CG LYS B 273 

ATOM 11431 CD LYS B 273 



373/514 



Figure 5 
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16.276 


ATOM 


122B8 


o 


ILE 


B 


327 


' 37.618 


44.162 


15.284 


ATOM 


12289 


N 


GLU 


B 


328 


37.297 


42.215 


16.335 


ATOM 


12291 


CA 


GLU 


B 


328 


36.585 


41.605 


15.193 


ATOM 


12293 


CB 


GLD 


B 


328 


36.124 


40.177 


15.508 


ATOM 


12296 


CG 


GLD 


B 


328 


37.166 


39.098 


15.244 


ATOM 


12299 


CD 


GLU 


B 


328 


37.258 


38.695 


13.777 


ATOM 


12300 


OEl 


GLD 


B 


328 


37.581 


37.510 


13.517 


ATOM 


12301 


OE2 


GLU 


B 


328 


37.020 


39.551 


12.878 


ATOM 


12302 


c 


GLD 


B 


328 


35.366 


42.411 


14.730 


ATOM 


12303 
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GLD 


B 


328 


35.072 


42.453 


13.540 
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- 12304 


N 
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B 
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34 .678 


43.056 


15.675 


ATOM 
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CA 
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B 


.329 


33.396 


43.706 


15.427 
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12308 


CB 


ARG 


B 
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32.563 


43.649 
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CG 
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B 
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31.133 


44.137 


16.560 


ATOM 


12314 


CD 
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B 
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30.235 


43.641 


17.673 


ATOM 


12317 


NE 


ARG 


B 


329 


30.943 


43.662 


18.948 
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ARG 


B 
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31.198 


44.760 


19.666 
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B 
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44.656 


20.B09 


ATOM 
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19.256 
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329 


33.532 


45.161 


14.974 


ATOM 
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B 


329 


32.888 


45.562 


14.015 


ATOM 


12328 
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VAL 
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330 


34.357 


45.944 


15.660 


ATOM 


12330 


• CA 


VAL 


B 


330 


34.509 


47.374 


15.361 


ATOM 


12332 


CB 


VAL 


B 
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35.063 


48.148 


16.584 


ATOM 


12334 


CGI 


VAL 


B 
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35.317 


49.609 


16.225 


ATOM 
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17.771 
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B 
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12343 
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VAL 


B 
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13.396 
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N 


ILE 


B 
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36.517 


46.795 


14.165 


ATOM 
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CA 


ILE 


B 


331 


37.516 


46.769 


13.097 



1.00 52.57 C 

1.00 54.91 C 

1.00 60.48 N 

1.00 44.92 C 

1.00 42.91 0 

1.00 41.09 N 

1.00 37.70 C 

1.00 32.10 C 

1.00 31.21 C 

1.00 31.20 C 

1.00 40.36 C 

1.00 42.66 0 

1.00 40.69 N 

1.00 42.94 C 

1.00 41.53 C 

1.00 40.85 C 

1.00 44.29 C 

1.00 44.23 0 

1.00 43.10 N 

1.00 45.79 C 

1.00 45.30 O 

1.00 47.59 N 

1.00 51.73 C 

1.00 57.74 C 

1.00 64.44 C 

1.00 71.38 C 

1.00 70.36 O 

1.00 74.79 O 

1.00 51.42 C 

1.00 50.44 O :,; 

l.oo 49.00 n t 

1.00 49.23 C 

1.00 50.78 C 

1.00 53.03 C 

1.00 54.81 C ^ 

1.00 55.68 O >„..;. 

1.00 57.25 O • -A: 

l.oo 49.61 c • 

1.00 53.26 O ■ i^s* 

1.00 49.05 N ■', ■ iri'ir. 

1.00 49.13 C 

1.00 48.28 C 

1.00 44.91 C 

1.00 41.80 C 

1.00 45.94 C 

1.00 52.37 C 

1.00 53.70 O 

1.00 54.56 N 

1.00 56.63 ' C 

1.00 60.16 C 

1.00 67.68 C 

1.00 75.82 C 

1.00 80.86 O 

1.00 80.83 O 

1.00 56.31 C 

1.00 56.03 . O 

1.00 55.54 N 

1.00 51.95 C 

1.00 53.35 C 

1.00 53.54 C 

1.00 53.07 C 

1.00 55.73 N 

1.00 55.55 C 

1.00 53.37 N 

1.00 53.68 N 

1.00 50.23 C 

1.00 52.14 O 

1.00 45.19 N 

1.00 44.83 C 

1.00 42.37 C 

1.00 39.33 C 

1.00 40.05 C 

1.00 47.84 C 

1.00 45.31 O 

1.00 51.01 N 

1.00 52.82 C 



SUBSTITUTE SHEET (RULE 26) 



WO 03/035693 



ATOM 12348 CB ILE B 331 

ATOM 12350 CGI ILE B 331 

ATOM 12353 CD1 ILE B 331 

ATOM 12357 CG2 ILE B 331 

ATOM 12361 C ILE B 331 

ATOM 12362 0 ILE B 331 

ATOM 12363 N GLY B 332 

ATOM 12365 CA GLY B 332 

ATOM 12368 C GLY B 332 

ATOM 12369 0 GLY B 332 

ATOM 12370 N ARG B 333 

ATOM 12372 CA ARG B 333 

ATOM 12374 CB ARG B 333 

ATOM 12377 CG ARG B 333 

ATOM 12380 CD ARG B 333 

ATOM 12383 NE ARG B 333 

ATOM 12385 CZ ARG.B 333 

ATOM 12386 NH1 ARG B 333 

ATOM 12389 NH2 ARG B 333 

ATOM 12392 C ARG B 333 

ATOM 12393 O ARG B 333 

ATOM 12394 N ASM B 334 

ATOM 12396 CA ASN B 334 

ATOM 12398 CB ASN B 334 

ATOM 12401 CG ASN B 334 

ATOM 12402 OD1 ASN B 334 
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CA 
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CB 


CYS B 338 
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SG 
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12465 


C 


CYS B 338 


ATOM 


12466 


0 


CYS B 338 
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12467 


N 


MET B 339 
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12469 


CA 


MET B 339 


ATOM 


12471 


CB 


MET B 339 


ATOM 


12474 


CG 


MET B 339 


ATOM 


12477 


SD 


MET B 339 


ATOM 


12478 


CE 


MET B 339 


ATOM 


12462 


C 


MET B 339 


ATOM 


12483 


0 


MET B 339 


ATOM 
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N 


GLN B 340 
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12486 


CA 


GLN B 340 


ATOM 


12488 


CB 
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ATOM 


12491 


CG 
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12494 


CD 
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ATOM 
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NE2 
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C 
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ATOM 


12500 


0 


GLN B 340 



PCT/GB02/04872 

380/514 



Figure 5 

38.943 46.904 13.724 1.00 50.59 C 

39.015 48.142 14.636 1.00 48.52 C 

40.127 48.117 15.684 1.00 49.03 C 

39.999 47.008 12.646 1.00 50.48 C 

37.365 45.445 12.335 1.00 56.51 C 

36.966 44.416 12.902 1.00 59.42 0 

37.639 45.434 11.048 1.00 60.33 N 

37.641 44.152 10.362 1.00 63.25 C 

38.836 43.301 10.774 1.00 63.78 C 

39.479 43.550 11.789 1.00 57.35 0 

39.122 42.278 9.979 1.00 68.79 N 

40.450 41.679 9.973 1.00 72.05 C 

40.386 40.178 9.658 1.00 75.29 C 

40.587 39.275 10.878 1.00 80.45 C 

41.294 37.953 10.579 1.00 84.54 C ' 

40.838 36.877 11.464 1.00 90.01 N 

39.666 36.235 11.364 1.00 94.47 C 

38.771 36.541 10.418 1.00 96.99 N 

39.381 35.274 12.236 1.00 95.05 N 

41.287 42.416 8.930 1.00 72.07 C 

42.514 42.401 8.998 1.00 72.21 0 

40.612 43.083 7.990 1.00 71.97 N 

41.276 43.711 6.847 1.00 72.04 C 

40.281 43.978 5.701 1.00 73.89 C 

39.644 42.699 5.168 1.00 76.22 C 

38.437 42.652 4.911 1.00 80.28 0 

40.448 41.650 5.017 1.00 75.92 N 

42.011 44.994 7.222 1.00 67.53 C 

43.239 45.000 7.253 1.00 68.52 0 

41.279 46.070 7.515 1.00 60.88 N 

41.921. 47.364 7.779 1.00 55.97 C 

40.933 . 48.523 7.668 1.00 54.56 C 

39.811 48.529 8.684 1.00 51.91 C 

38.986 49.796 8.621 1.00 54.15 C 

39.018 50.508 9.892 1.00 55.24 N 

38.210 50.270 10.902 1.00 52.39 C 

37.264 49.345 10.812 1.00 60.64 N 

38.344 50.957 12.007 1.00 47.19 N 

, 42.638 47.446 9.129 1.00 54.90 C 

42.315 46.741 10.104 1.00 52.17 0 

43.617 48.334 9.175 1.00 50.64 N 

44.401 48.503 10.378 1.00 49.64 C 

45.808 49.010 10.015 1.00 51.18 C 

45.860 50.413 10.001 1.00 52.19 O 

43.649 49.419 11.374 1.00 45.36 C 

42.806 50.242 10.969 1.00 44.10 0 

43.884 49.236 12.672 1.00 39.02 N 

43.179 50.032 13.687 1.00 36.79 C 

43.636 49.419 15.004 1.00 39.79 C 

44.065 48.018 14.604 1.00 41.37 C 

44.760 48.232 13.286 1.00 38.31 C 

43.581 51.467 13.611 1.00 32.90 C 

44.591 51.757' 12.978 1.00 31.41 . 0 

42.765 52.333 14.198 1.00 31.34 N 

43.050 53.767 14.302 1.00 33.19 C 

42.604 54.519 13.061 1.00 33.40 C 

40.819 54.752 12.978 1.00 41.64 S 

42.376 54.321 15.540 1.00 31.25 C 

41.558 53.650 16.156 1.00 33.29 0 

42.748 55.531 15.923 1.00 33.28 N 

42.302 56.097 17.192 1.00 33.94 C 

42.988 57.445 17.419 1.00 34.41 C 

44.480 57.338 17.716 1.00 35.79 C 

44.792 56.018 18.892 1.00 36.24 S 

44.226 56.678 20.362 1.00 39.22 C 

40.780 56.254 17.297 1.00 36.78 C 

40.210 56.151 18.381 1.00 36.23 0 

40.120 56.498 16.170 1.00 41.14 N 

38.672 56.689 16.166 1.00 41.45 C 

38.145 57.037 14.765 1.00 44.15 C 

37.870 58.535 14.569 1.00 47.15 C 

37.539 58.884 13.123 1.00 49.99 C 

36.647 58.275 12.525 1.00 51.07 0 

38.253 59.867 12.562 1.00 46.81 N 

37.947 55.467 16.698 1.00 41.44 C 

36.864 55.612 17.256 1.00 44.50 0 
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38.762 
37.776 
39.784 
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38.649 
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41.400 
41.615 
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34.451 
33.577 
33.254 
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.32.536 
33.166 
34.143 
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35.953 
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45.583 44.172 34.638 1.00 33.05 

45.417 44.644 33.368 1.00 29.24 

44.689 43.913 32.456 1.00 33.40 

44.143 42.720 32.811 1.00 33.15 

44.322 42.239 34.100 1.00 34.76 

42.911 42.954 37.205 1.00 42.85 

42.474 41.796 37.073 1.00 39.75 

42.137 44.038 37.158 1.00 42.94 

40.726 43.966 36.802 1.00 44.24 

39.864 43.916 38.058 1.00 48.45 

38.393 43.694 37.751 1.00 51.42 

37.585 43.673 38.705 1.00 61.18 

37.953 43.503 36.600 1.00 55.69 

40.306 45.159 35.947 1.00 41.54 

40.192 46.287 36.458 1.00 31.52 

40.052 44.919 34.657 1.00 39.01 

39.650 46.007 33.759 1.00 40.67 

39.506 45.322 32.387 1.00 41.75 

40.161 43.971 32.516 1.00 40.78 

40.113 43.621 33.961 1.00 41.74 

38.347 46.719 34.189 1.00 41.38 

38.154 47.876 33.794 1.00 35.27 

37.497 46.066 34.993 1.00 44.48 

36.230 46.677 35.441 . 1.00 45.62 

35.285 45.625 36.063 1.00 48.18 

34.637 44.740 35.043 1.00 51.72 

33.476 45.094 34.382 1.00 51.63 

33.176 44.155 33.501 1.00 56.83 

34.104 43.209 33.558 1.00 55.65 

35.032 43.555 34.511 1.00 53.25 

36.436 47.871 36.362 1.00 44.02 

35.506 48.614 36.631 1.00 45.97 

37.666 48.047 36.837 1.00 47.01 

38.092 49.246 37.554 1.00 45.91 

39.588 49.132 37.932 i.00 46.82 

39.904 48.333 39.168 1.00 47.89 

40.402 48.962 40.283 1.00 57.14 

40.702 48.052 41.205 1.00 58.22 

40.456 46.843 40.710 1.00 52.28 
39.974 46.989 39.431 1.00 50.06 

37.871 50.489 36.642 1.00 43.05 

37.872 51.625 37.105 1.00 41.89 
37.705 50.261 35.339 1.00 44.22 
37.428 51.326 34.361 1.00 43.55 
38.663 51.599 33.464 1.00 41.08 
39.847 52.157 34.212 1.00 35.75 
40.781 51.307 34.782 1.00 30.99 
41.B57 51.804 35.484 1.00 33.70 
42.030 53.171 35.618 1.00 34.69 
41.102 54.036 35.049 1.00 37.32 
40.017 53.527 34.348 1.00 32.97 
36.212 50.987 33.491 1.00 44.26 
36.196 51.294 32.300 1.00 44.30 
35.194 ' 50.384 34.105 1.00 47.09 
33.927 50.058 33.444 1.00 48.12 
33.972 48.636 32.898 1.00 45.27 
34.900 48.303 31.743 1.00 40.01 
34.821 46.814 31.517 1.00 42.30 
34.539 49.027 30.468 1.00 37.59 

32.703 50.188 34.384 1.00 52.07 
32.705 49.684 35.526 1.00 51.79 
31.649 50.819 33.861 1.00 54.94 
30.425 51.102 34.613 1.00 56.14 
29.784 52.429 34.136 1.00 55.19 
29.136 52.329 32.746 1.00 55.31 
28.803 53.385 32.153 1.00 58.16 
28.913 51.252 32.158 1.00 59.46 

29.457 49.932 34.505 1.00 59.66 
29.833 48.876 34.006 1.00 58.36 
28.244 50.137 35.025 1.00 67.04 
27.063 49.266 34.853 1.00 71.76 



SUBSTITUTE SHEET (RULE 26) 



WO 03/035693 



PCT/GB02/04872 



389/514 



Figure 5 ■ 
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Figure 5 
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Figure 5 
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Figure 5 

48.583 41.684 1.00 87.54 N 

49.091 42.095 1.00 89.57 C 

48.610 41.137 1.00 91.42' C 

47.102 41.210 1.00 94.19 ' C 

46.326 40.703 1.00 94.88 O 

46.680 41.859 1.00 95.18 N 

50.615 42.214 1.00 88.47 C 

51.333 41.254 1.00 83.30 O 

51.081 43.402 1.00 89.74 N 

52.490 43.758 1.00 90.50 C 

53.283 43.047 1.00 91.74 C 

53.205 43.413 1.00 90.64 O 

54.018 42.010 1.00 91.94 N 

54.982 41.285 1.00 89.81 C 

56.196 40.822 1.00 91.80 C 

57.090 41.950 1.00 94.77 C 

58.147 41.630 1.00 95.71 C 

58.989 42.622 1.00 94.74 C 

58.784 43„95B 1.00 94.68 C 

57.740 44.297 1.00 92.90 C 

56.900 43.299 1.00 94.20 C 

54.385 40.040 1.00 84.51 C 

55.072 39.386 1.00 84.09 O 

53.137 39.694 1.00 77.54 N 

52.459 38.539 1.00 72.44 C 

52.436 37.341 1.00 70.79 C 

51.037 38.907 1.00 68.26 C 

50.593 40.054 1.00 67.52 O 

50,353 37.939 1.00 60.70 N 

48.978 38.126 1.00 57.04 C 

48.894 39.04B 1.00 58.21 C 

49.705 38.605 1.00 60,97 O 

48.352 36.790 1.00 52.88 C 
49.062 35.834 1.00 49.59 ■ '* ' O' 
47.020 36.737 1.00 46.89 o :< V. N 

46.285 35.471 1.00 44.49 .#*• C 

46.170 34.954 1.00 40.35 . I C 
47.544 34.681 1.00 39.58 . - - ^ C 

45.424 35.935 1.00 39.08:. CI' 
44.876 35.657 1.00 44.38' ■ I C 
44.366 36.766 1.00 49.04 ' . O • 

44.239 34.597 1.00 41.57 N 

42.912 34.751 1.00 42.61 C 

42.744 33.463 1.00 39.62 C 

43.543 32.453 1.00 39.43 C 

44.695 33.197 1.00 38.63 C 

41.841 34.831 1.00 45.77 C 

42.120 34.488 1.00 50.02 O 

40.632 35.245 1.00 46.34 N 

39.542 3^.225 1.00 47.12 C 

38.453 36.099 1.00 45.16 C 

38,684 35.941 1.00 46.23 C 

40.188 35.746 1.00 47.53 C ■ 

39.041 33.803 1.00 45.57 C 

39.293 32.923 1.00 45.15 O 

38.332 33.602 1.00 42.27 N 

37.693 32.341 1.00 39.00 C 

36.924 32.427 1,00 39.89 C 

36.160 31.177 1.00 40.74 C 
34.797 31.101 1.00 37.74 C 
34.111 29.942 1.00 41.00 C 
34.782 28.825 1.00 40.62 C 
36.132 28.884 1.00 40.63 C 
36,822 30.054 1.00 40.80 C 
36.739 31.962 1.00 39.47 C 
36.062 32.817 1.00 44.71 O 
36.695 30.681 1.00 37.43 N 
35.698 30.195 1.00 38.26 C 
36.052 30.576 1.00 37.96 C 

37.161 29.723 1.00 36.10 C 
36.878 28.601 1.00 33.64 C 
37.901 27.809 1.00 32.12 C 
39.212 28.124 1.00 32.43 C 
40.242 27.327 1.00 35.93 O 
39.502 29.228 1.00 28.64 C 
38.494 30.012 1.00 26.13 C 
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Figure 5 

58.921 35.589 28.691 1.00 38.77 
59.486 36.473 28.044 1.00 43.13 
58.368 34.503 28.162 1.00 39.23 
58.339 34.232 26.733 1.00 42.80 

59.247 33.060 26.360 1.00 45.49 

60.728 33.221 26.678 1.00 51.47 

61.505 . 31.901 26.506 1.00 54.57 

61.058 30.842 26.988 1.00 54.51 

62.651 31.967 25.828 1.00 47.10 

56.938 33.850 26.311 1.00 41.81 

56.116 33.456 27.138 1.00 43.85 

56.692 33.945 25.007 1.00 44.20 

55.407 33.586 24.409 1.00 43.57 

54.394 34.691 24.675 1.00 42.60 

54.405 35.931 23.771 1.00 40.44 

53.079 36.694 23.878 1.00 41.53 

55.585 36.039 24.127 1.00 40.96 

55.515 33.378 22.903 1.00 46.37 

56.564 33.639 22.293 1.00 50.60 

54.402 32.965 22.305 1.00 46.20 

54.303 32.803 20.861 1,00 46.00 

54.036 31.354 20.524 1.00 48.22 

55.149 30.221 21.337 1.00 55.68 

53.177 33.636 20.269 1.00 44.86 

52.058 33.669 20.777 1.00 41.38 

53.493 34.281 .19.161 1.00 43.43 

52.539 35.018 18.393 1.00 40.56 

53.233 36.267 17.873 1.00 39.13 

53.699 37.186 18.979 1.00 39.59 

55.004 37.141 19.443 1.00 42.12 

55.430 37.994 20.460 1.00 41.04 

54.543 38.894 21.031 1.00 38.31 

53.239 38.938 20.583 1.00 39.02 

52.823 38.092 19.563 1.00 37.87 

52.006 34.100 17.295 1.00 40.89 

52.546 34.033 16.201 1.00 44.07 

50.968 33.345 17.629 1.00 44.99 

50.336 32.413 16.690 1.00 48.17 

49.576 31.286 17.454 1.00 49.83 

50.560 30.267 18.009 1.00 52.13 
51.187 30.719 19.256 1.00 56.47 

48.561 30.561 16.563 1.00 50.33 
49.362 33.195 15.825 1.00 49.33 
48.398 33.759 16.353 1.00 49.41 
49.588 33.239 14.511 1.00 48.42 
48.637 33.879 13.591 1.00 47.81 
49.207 33.526 12.223 1.00 45.70 
50.650 33.421 12.468 1.00 46.66 
50.765 32.726 13.792 1.00 47.29 
47.190 33.383 13.723 1.00 49.60 
46.932 32.269 14.193 1.00 53.68 
46.253 34.220 13.311 1.00 50.54 
44.839 33.910 13.460 1.00 53.88 
44.021 35.189 13.822 1.00 56.65 
44.602 35.889 15.071 1.00 56.58 
43.943 36.172 12.639 1.00 55.77 
44.294 33.243 12.183 1.00 56.50 
44.745 33.574 11.080 1.00 55.59 
43.413 32.361 12.217 1.00 58.51 
56.685 60.810 29.745 1.00 32.88 
57.469 62.064 29.648 1.00 29.60 
57.844 62.983 30.635 1.00 19.44 
5B.574 64.187 30.139 1.00 19.96 
59.127 65.395 30.839 1.00 29.43 
58.129 66.515 31.050 1.00 40.72 
58.553 .67.622-31.716 1.00 45.47 
58.025 68.709 31.535 1.00 42.97 
59.495 67.550 32.494 1.00 42.16 
57.978 62.643 28.502 1,00 19.94 
58.700 63.932 28.711 1.00 21.83 
59.349 64.823 27.697 1.00 22.05 
57.943 62.140 27.138 1.00 10.30 
57.338 60.904 26.740 1.00 15.54 
57.266 60.421 25.348 1.00 15.15 
57.822 60.986 24.097 1.00 19.75 
57.685 62.447 23.830 1.00 21.61 
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Figure 5 

56.549 59.148 25.455 

'56.229 58.175 24.374 

56.317 59.059 26.935 

56.781 60.073 27.663 

55.594 57.974 27.581 

55.303 57.824 29.010 

54.559 56.637 29.485 

54.083 55.524 28.638 

53.726 54.224 29.268 

54.367 56.811 30.922 

53.684 55.894 31.906 

55.053 58.124 31.105 

55.591 58.704 29.980 

55.120 58.709 32.455 

55.750 59.978 32.838 

56.374 60.791 31.957 

56.836 61.829 32.676 

57.536 62.926 32.056 

55.758 60.477 34.234 

55.231 59.858 35.465 

56.512 61.736 34.106 

56.796 62.672 35.206 

56.28.6 64.081 34.968 

56.282 64.848 36.282 

57.200 65.605 36.553 

55.376 64.722 37.095 

22.332 66.101 17.605 

39.184 67.864 0.001 

22.056 87.604 21.635 

27.500 67.000 -0.128 

. 1.104 69.686 20.708 

8.566 57.157 21.230 

40.398 83.015 15.513 
37.456 61.356 17.014 
24.050 64.615 37.173 
12.129 87.344 -2.599 
37.168 87.666 18.527 
14.590 74.553 -1.456 
19.193 83.690 16.494 
27.712 76.184 2.633 
43.102 78.613 11,891 

3.853 72.508 29.609 

-5.059 76.270 32.784 

-5.363 57.919 31.417 

10.301 91.578' 1.728 

28.270 65.046 40.231 

7.163 85.912 -3.043 

14.287 61.919 1.304 

9.027 50.947 31.814 

12.374 75.274 19.679 

5.893 79.717 49.307 

18.461 71.665 38.701 

24.404 67.509 39.056 

44.677 79.411 24.432 

33.788 63.777 15.135 

-3.551 63.803 23.749 

33.399 80.082 29.766 
15.610 83.594 21.904 
27.914 57.685 35.536 
46.028 66.475 18.676 
40.340 81.794 8.593 
31.072 61.378 36.156 
27.037 71.946 38.741 
20.274 60.935 25.175 
19.089 94.703 16.262 

4.851 79.290 18.114 

13.188 63.252 4.505 

35.356 61.598 15.387 

37.145 84.259 13.711 

24.256 75.099 36.311 

15.474 61.460 18.010 

13.422 73.606 30.124 
21.317 73.327 25.139 

21.423 73.465 22.667 
12.463 72.061 21.045 
41.339 74.411 22.961 



1.00 13.87 C 

1.00 20.80 C 

1.00 20.60 C 

1.00 16.80 N 

1.00 21.69 C 

1.00 16.63 C 

1.00 17.32 C 

1.00 18.16 C 

1.00 20.16 C 

1.00 18.50 C 

1.00 20.53 C 

1.00 18.61 C 

1.00 14.65 N 

1.00 17.86 C 

1.00 21.95 C 

1.00 18.63 N 

1.00 15.05 C 

1.00 16.56 C 

1.00 19.04 C 

1.00 21.81 C 

1.00 20.49 C 

1.00 27.56 C 

1.00 32.06 C 

1.00 35.26 C 

1.00 38.75 O 

1.00 35.97 O 

1.00 37.45 O 

0.50 43.93 O 

1.00 34.04 O 

1.00 40.58 O 

1.00 35.67 O 

1.00 35.27 O 

1.00 39.71 O 

1.00 44.00 O 

1.00 34.60 O 

1.00 45.59 O 

1.00 39.28 O 

1.00 39.13 O 

1.00 43.76 O 

1.00 34.87 O 

1.00 40.26 O 

1.00 42.76 O 

1.00 48.36 O 

1.00 55.31 O 

1.00 44.91 O 

1.00 50.04 O 

1.00 45.72 O 

1.00 56.90 O 

1.00 47.11 O 

1.00 26.99 O 

1.00 47.04 O 

1.00 50.71 O 

1.00 30.26 O 

1.00 61.22 O 

1.00 39.88 O 

1.00 43.84 O 

1.00 39.25 O 

1.00 40.99 O 

1.00 50.29 O 

1.00 35.21 O 

1.00 39.78 O 

1.00 41.52 O 

1.00 32.63 O 
1.00 47.32 " O 

1.00 51.56 O 

1.00 33.85 O 

1.00 33.45 O 

1.00 28.04 O 

1.00 43.51 O 

1.00 53.60 O 

1.00 42.53 O 

1.00 25.06 O 

1.00 24.80 O 

1.00 27.99 O . 

1.00 46.29 O 

1.00 43.80 O 
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Figure 7 
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Table 19 

9.736 63.854 49.703 1.00 

9.926 64.926 50.648 1.00 

11.179 ' 64.699 51.522 1.00 
11.213 65.582 52.635 1.00 
10.081 66.214 49.889 1.00 
10.135 66.209 48.670 1.00 

-1.691 79.606 49.752 1.00 

-1.990 79.922 48.375 1.00 

-2.508 81.324 48.227 1.00 

-2.688 81.625 46.729 1.00 

-3.800 81.503 49.042 1.00 

-4.233 82.961 49.184 1.00 

-0.817 79.793 47.444 1.00 

-0.880 79.195 46.369 1.00 

11.288 78.658 45.498 1.00 

11.681 78.446 44.125 1.00 

10.620 77.648 43.352 1.00 

11.031 77.295 41.916 1.00 

9.841 76.728 41.126 1.00 

12.254 76.363 41.913 1.00 

11.911 79.726 43.386 1.00 

12.797 79.762 42.534 1.00 

11.087 80.767 43.655 1.00 

11.180 82.079 43.050 1.00 
12.622 82.693 43.140" 1.00 
12.728 84.199 42.944 1.00 
12.192 84.972 43.782 1.00 
13.356 84.584 41.921 1/00 
10.614 82.028 41.640 1.00 
10.039 81.024 41.214 1:00 

10.531 83.213 40.999 .liOO 
10.137 83.514 39.647 1.00 

9.474 84.857 39.534 1:00 

8.170 84.823. 40.349 1.00 

10.463 85.943 39.991 1.00 

11.277 83.498. 38.659 1.00 

11.041 83.343 37.459 1.00 

14.532 79.499- 38.222 1.00 
13.844 78.448 38.924 1.00 
12.578 78.003 38.170 1.00 
12.004 76.675 38.660 1.00 
10.541 76.087 37.755 1.00 
10.690 74.386 38.373 1.00 
14.747 77.256 38.962 1.00 
14.767 76.516 39.943 1.00 

17.746 72.929 46.226 1.00 

16.752 72.143 46.887 1.00 
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7.676 71.191 44.865 1.00 

8.641 71.771 43.853 1.00 

8.623 73.312 43.861 1.00 

10.027 71.184 44.151 1.00 

6.282 71.614 44.497 1.00 

5.610 70.934 43.737 1.00 

12.355 53.260 33.477 1.00 

10.972 53.538 33.715 1.00 

12.160 53.614 36.917 1.00 
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ATOM 3606 OG SER A 482 22.058 

ATOM 3607 C SER A 482 24.179 

ATOM 3608 0 SER A 482 24.985 

ATOM 3630 N ILE A 485 28.457 

ATOM 3631 CA ILE A 485 29.664 

ATOM 3632 CB ILE A 485 29.609 

ATOM 3633 CG2 ILE A 485 28.435 

ATOM 3634 CGI ILE A 485 30.939 

ATOM 3635 CD1 ILE A 485 30.834 

ATOM 3636 C ILE A 485 29.736 

ATOM 3637 O ILE A 485 28.732 



Figure 9 



92.615 21.465 1.00 0.00 A O 

91.128 22.603 1.00 0.00 A C 

91.470 23.467 1.00 0.00 A O 

91.464 15.646 1.00 0.00 A N 

90.889 15.105 1,00 0.00 A C 

89.380 15.236 1.00 0.00 A C 

88.876 14.380 1.00 0.00 A C 

88.670 14.941 1.00 0.00 A C 

87.151 15.088 1.00 0.00 A C 

91.231 13.634 1.00 0.00 A C 

91.585 13.019 1.00 0.00 A O 
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10 20 30 40 50 

2C9 MDSLWLVLC LSCLLLLSLW RQSSGRGKLP PGPTPLPVIG NILQIGIKDI 

2C9trunc MA KKTSSKGR -P PGPTPLPVIG NILQIGIKDI 

2C9-FGloop MA. KKTSSKGR -P PGPTPLPVIG NILQIGIKDI 

2C9-P220 MA KKTSSKGR -P PGPTPLPVIG NILQIGIKDI 

60 70 80 90 100 

2C9 SKSLTNLSKV YGPVFTLYFG LKPIWLHGY EAVKEALIDL GEEFSGRGIF 

2C9trunc SKSLTNLSKV YGPVFTLYFG LKPIWLHGY EAVKEALIDL GEEFSGRGIF 

2C9-FGloop SKSLTNLSKV YGPVFTLYFG LKPIWLHGY EAVKEALIDL GEEFSGRGIF 

2C9-P220 SKSLTNLSKV YGPVFTLYFG LKPIWLHGY EAVKEALIDL GEEFSGRGIF 

110 120 130 140 150 

2C9 PLAERANRGF GIVFSNGKKW KEIRRFSLMT LRNFGMGKRS IEDRVQEEAR 

2C9trunc PLAERANRGF GIVFSNGKKW KEIRRFSLMT LRNFGMGKRS IEDRVQEEAR 

2C9~FGloop PLAERANRGF GIVFSNGKKW KEIRRFSLMT LRNFGMGKRS IEDRVQEEAR 

2C9-P220 PLAERANRGF GIVFSNGKKW KEIRRFSLMT LRNFGMGKRS IEDRVQEEAR 

160 170 180 190 200 

2C9 CLVEELRKTK ASPCDPTFIL GCAPCNVICS IIFHKRFDYK DQQFLNLMEK 

2C9trunc CLVEELRKTK ASPCDPTFIL GCAPCNVICS IIFHKRFDYK DQQFLNLMEK 

2C9-FGloop CLVEELRKTK ASPCDPTFIL GCAPCNVICS IIFHKRFDYK DQQFLNLMEK 

2C9-P220 CLVEELRKTK ASPCDPTFIL GCAPCNVICS IIFHKRFDYK DQQFLNLMEK 

210 .. . 220 230 240 250 

2C9 LNENIKILSS PWIQICNNFS PIIDYFPGTH NKLLKNVAFM KSYILEKVKE 

2C9trunc " LNENIKILSS PWIQICNNFS PIIDYFPGTH NKLLKNVAFM KSYILEKVKE 

2C9-FGloop LNENIKILSS PWIQVYNN FP ALL DYFPGTH NKLLKNVAFM KSYILEKVKE 

2C9-P220 LNENIKILSS PWIQICNNFP TIIDYFPGTH NKLLKNVAFM KSYILEKVKE 

260 270 280 290 300 

2C9 HQESMDMNNP QDFIDCFLMK MEKEKHNQPS EFTIESLENT AVDLFGAGTE 

2C9trunc HQESMDMNNP QDFIDCFLMK MEKEKHNQPS EFTIESLENT AVDLFGAGTE 

2C9-FGloop HQESMDMNNP QDFIDCFLMK MEKEKHNQPS EFTIESLENT AVDLFGAGTE 

2C9-P220 HQESMDMNNP QDFIDCFLMK MEKEKHNQPS EFTIESLENT AVDLFGAGTE 

.310 320 330 340 350 

2C9 TTSTTLRYAL LLLLKHPEVT AKVQEEIERV IGRNRSPCMQ DRSHMPYTDA 

2C9trunc TTSTTLRYAL LLLLKHPEVT AKVQEEIERV IGRNRSPCMQ DRSHMPYTDA 

2C9-FGloop TTSTTLRYAL LLLLKHPEVT AKVQEEIERV IGRNRSPCMQ DRSHMPYTDA 

2C9-P220 TTSTTLRYAL LLLLKHPEVT AKVQEEIERV IGRNRSPCMQ DRSHMPYTDA 

360 370 380 390 400 

2C9 WHEVQRYID LLPTSLPHAV TCDIKFRNYL IPKGTTILIS LTSVLHDNKE 

2C9trunc WHEVQRYID LLPTSLPHAV TCDIKFRNYL IPKGTTILIS LTSVLHDNKE 

2C9-FGloop WHEVQRYID LLPTSLPHAV TCDIKFRNYL .IPKGTTILIS LTSVLHDNKE . 

2C9-P220 WHEVQRYID LLPTSLPHAV TCDIKFRNYL IPKGTTILIS LTSVLHDNKE 

410 420 430 440 450 

2C9 FPNPEMFDPH HFLDEGGNFK KSKYFMPFSA GKRICVGEAL AGMELFLFLT 

2C9trunc FPNPEMFDPH HFLDEGGNFK KSKYFMPFSA GKRICVGEAL AGMELFLFLT 

2C9-FGloop FPNPEMFDPH HFLDEGGNFK KSKYFMPFSA GKRICVGEAL AGMELFLFLT 

2C9-P220 FPNPEMFDPH HFLDEGGNFK KSKYFMPFSA GKRICVGEAL AGMELFLFLT ' 

460 .470 480 " 490 500 

2C9 SILQNFNLKS LVDPKNLDTT PWNGFASVP PFYQLCFIPV 

2C9trunc SILQNFNLKS LVDPKNLDTT PWNGFASVP PFYQLCFIPV HHHH 

2C9-FGloop SILQNFNLKS LVDPKNLDTT PWNGFASVP PFYQLCFIPV HHHH 

2C9-P220 SILQNFNLKS LVDPKNLDTT PWNGFASVP PFYQLCFIPV HHHH 
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Sequence Listing: 



2C9trunc 
SEQ ID NO:l 

ATGGCTAAGAAAACGAGCTCTAAAGGGCGGCCGCCTGGCCCCACTCCTCTCCCAGTGATTGGAAATATCCTACAGATAGGTATTAA 
GGACATCAGCAAATCCTTAACCAATCTCTCAAAGGTCTATGGCCCTGTGTTCACTCTGTATTTTGGCCTGAAACCCATAGTGGTGC 
TGCATGGATATGAAGCAGTGAAGGAAGCCCTGATTGATCTTGGAGAGGAGTTTTCTGGAAGAGGCATTTTCCCACTGGCTGAAAGA 
GCTAACAGAGGATTTGGAATTGTTTTCAGCAATGGAAAGAAATGGAAGGAGATCCGGCGTTTCTCCCTCATGACGCTGCGGAATTT 
TGGGATGGGGAAGAGGAGCATTGAGGACCGTGTTCAAGAGGAAGCCCGCTGCCTTGTGGAGGAGTTGAGAAAAACCAAGGCCTCAC 
CCTGTGATCCCACTTTCATCCTGGGCTGTGCTCCCTGCAATGTGATCTGCTCCATTATTTTCCATAAACGTTTTGATTATAAAGAT 
CAGCAATTTCTTAACTTAATGGAAAAGTTGAATGAAAACATCAAGATTTTGAGCAGCCCCTGGATCCAGATCTGCAATAATTTTTC 
TCCTATCATTGATTACTTCCCGGGAACTCACAACAAATTACTTAAAAACGTTGCTTTTATGAAAAGTTATATTTTGGAAAAAGTAA 
AAGAACACCAAGAATCAATGGACATGAACAACCCTCAGGACTTTATTGATTGCTTCCTGATGAAAATGGAGAAGGAAAAGCACAAC 
CAACCATCTGAATTTACTATTGAAAGCTTGGAAAACACTGCAGTTGACTTGTTTGGAGCTGGGACAGAGACGACAAGCACAACCCT 
GAGATATGCTCTCCTTCTCCTGCTGAAGCACCCAGAGGTCACAGCTAAAGTCCAGGAAGAGATTGAACGTGTGATTGGCAGAAACC 
GGAGCCCCTGCATGCAAGACAGGAGCCACATGCCCTACACAGATGCTGTGGTGCACGAGGTCCAGAGATACATTGACCTTCTCCCC 
ACCAGCCTGCCCCATGCAGTGACCTGTGACATTAAATTCAGAAACTATCTCATTCCCAAGGGCACAACCATATTAATTTCCCTGAC 
TTCTGTGCTACATGACAACAAAGAATTTCCCAACCCAGAGATGTTTGACCCTCATCACTTTCTGGATGAAGGTGGCAATTTTAAGA 
AAAGTAAATACTTCATGCCTTTCTCAGCAGGAAAACGGATTTGTGTGGGAGAAGCCCTGGCCGGCATGGAGCTGTTTTTATTCCTG 
ACCTCCATTTTACAGAACTTTAACCTGAAATCTCTGGTTGACCCAAAGAACCTTGACACCACTCCAGTTGTCAATGGATTTGCCTC 
TGTGCCGCCCTTCTACCAGCTCTGCTTCATTCCTGTCCACCACCACCACTGA 

SEQ ID NO: 2 

MAKKTSSKGRPPGPTPLPVIGNILQIGIKDISKSLTNLSKVYGPVFTLYFGLKPIWLHGYEAVKEALIDLGEEFSGRGIFPLAER 
ANRGFGIVFSNGKKWKEIRRFSLMTLRNFGMGKRSIEDRVQEEARCLVEELRKTKASPCDPTFILGCAPCNVICSIIFHKRFDYKD 
QQFLN LME KLNEN I K I LS S PW I QI CNN FS P 1 1 D Y FPGT HNKLLKN VAFMK S Y I LEKVKE HQE S M DMNN PQ D F I DC FLMKME KEKHN 
QPSEFTIESLENTAVDLFGAGTETTSTTLRYALLLLLKHPEVTAKVQEEIERVIGRNRSPCMQDRSHMPYTDAWHEVQRYIDLLP 
TSLPHAVTCDIKFRNYLIPKGTTILISLTSVLHDNKEFPNPEMFDPHHFLDEGGNFKKSKYFMPFSAGKRICVGEALAGMELFLFL 
TSILQNFNLKSLVDPKNLDTTPVVNGFASVPPFYQLCFI PVHHHH 



1072 (2C9-P220) \ * -\ v > 

SEQ ID NO:3 .< 

ATGGCTAAGAAAACGAGCTCTAAAGGGCGGCCGCCTGGCCCCACTCCTCTCCCAGTGATTGGAAATATCCTACAGATAGGTATTAA 
GGACATCAGCAAATCCTTAACCAATCTCTCAAAGGTCTATGGCCCTGTGTTCACTCTGTATTTTGGCCTGAAACCCATAGTGGTGC 
TGCATGGATATGAAGCAGTGAAGGAAGCCCTGATTGATCTTGGAGAGGAGTTTTCTGGAAGAGGCATTTTCCCACTGGCTGAAAGA 
GCTAACAGAGGATTTGGAATTGTTTTCAGCAATGGAAAGAAATGGAAGGAGATCCGGCGTTTCTCCCTCATGACGCTGCGGAATTT 
TGGGATGGGGAAGAGGAGCATTGAGGACCGTGTTCAAGAGGAAGCCCGCTGCCTTGTGGAGGAGTTGAGAAAAACCAAGGCCTCAC 
CCTGTGATCCCACTTTCATCCTGGGCTGTGCTCCCTGCAATGTGATCTGCTCCATTATTTTCCATAAACGTTTTGATTATAAAGAT 
CAGCAATTTCTTAACTTAATGGAAAAGTTGAATGAAAACATCAAGATTTTGAGCAGCCCCTGGATCCAGATCTGCAATAATTTTCC 
GACCATCATTGATTACTTCCCGGGAACTCACAACAAATTACTTAAAAACGTTGCTTTTATGAAAAGTTATATTTTGGAAAAAGTAA 
AAGAACACCAAGAATCAATGGACATGAACAACCCTCAGGACTTTATTGATTGCTTCCTGATGAAAATGGAGAAGGAAAAGCACAAC 
CAACCATCTGAATTTACTATTGAAAGCTTGGAAAACACTGCAGTTGACTTGTTTGGAGCTGGGACAGAGACGACAAGCACAACCCT 
GAGATATGCTCTCCTTCTCCTGCTGAAGCACCCAGAGGTCACAGCTAAAGTCCAGGAAGAGATTGAACGTGTGATTGGCAGAAACC 
GGAGCCCCTGCATGCAAGACAGGAGCCACATGCCCTACACAGATGCTGTGGTGCACGAGGTCCAGAGATACATTGACCTTCTCCCC 
ACCAGCCTGCCCCATGCAGTGACCTGTGACATTAAATTCAGAAACTATCTCATTCCCAAGGGCACAACCATATTAATTTCCCTGAC. 
TTCTGTGCTACATGACAACAAAGAATTTCCCAACCCAGAGATGTTTGACCCTCATCACTTTCTGGATGAAGGTGGCAATTTTAAGA 
AAAGTAAATACTTCATGCCTTTCTCAGCAGGAAAACGGATTTGTGTGGGAGAAGCCCTGGCCGGCATGGAGCTGTTTTTATTCCTG 
ACCTCCATTTTACAGAACTTTAACCTGAAATCTCTGGTTGACCCAAAGAACCTTGACACCACTCCAGTTGTCAATGGATTTGCCTC 
TGTGCCGCCCTTCTACCAGCTCTGCTTCATTCCTGTCCACCACCACCACTGA 

SEQ ID NO: 4 

MAKKTSSKGRPPGPTPLPVIGNILQIGIKDISKSLTNLSKVYGPVFTLYFGLKPIVVLHGYEAVKEALIDLGEEFSGRGIFPLAER 
ANRG FG I V FS NGKKW KE I RR FS LMT LRN FGMG KRS I E DR VQE E ARCLV E E LRKTKAS PC D PT F I LGC A PCN V ICSIIFHKRFDYKD 
QQFLNLMEKLNENIKILSSPWIQICNNFPTIIDYFPGTHNKLLKNVAFMKSYILEKVKEHQESMDMNNPQDFIDCFLMKMEKEKHN 
QPSEFTIESLENTAVDLFGAGTETTSTTLRYALLLLLKHPEVTAKVQEEIERVIGRNRSPCMQDRSHMPYTDAVVHEVQRYIDLLP 
TSLPHAVTCDIKFRNYLIPKGTTILISLTSVLHDNKEFPNPEMFDPHHFLDEGGNFKKSKYFMPFSAGKRICVGEALAGMELFLFL 
T SI LQN FN LKS LVD PKNLDTTPWNG FAS VPPFYQLCFI PVHHHH 

2C9-FGloop (1015) 
SEQ ID NO: 5 

ATGGCTAAGAAAACGAGCTCTAAAGGGCGGCCGCCTGGCCCCACTCCTCTCCCAGTGATTGGAAATATCCTACAGATAGGTATTAA 
GGACATCAGCAAATCCTTAACCAATCTCTCAAAGGTCTATGGCCCTGTGTTCACTCTGTATTTTGGCCTGAAACCCATAGTGGTGC 
TGCATGGATATGAAGCAGTGAAGGAAGCCCTGATTGATCTTGGAGAGGAGTTTTCTGGAAGAGGCATTTTCCCACTGGCTGAAAGA 
GCTAACAGAGGATTTGGAATTGTTTTCAGCAATGGAAAGAAATGGAAGGAGATCCGGCGTTTCTCCCTCATGACGCTGCGGAATTT 
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TGGGATGGGGAAGAGGAGCATTGAGGACCGTGTTCAAGAGGAAGCCCGCTGCCTTGTGGAGGAGTTGAGAAAAACCAAGGCCTCAC 
CCTGTG AT CCCACT TT C AT CCTGGGCTGTGCTC CCTGC AATGTG AT CT GCTCC ATT ATTTT C C AT AAACGT TTTG ATT AT AAAG AT 
CAGCAATTTCTTAACTTAATGGAAAAGTTGAATGAAAACATCAAGATTTTGAGCAGCCCCTGGATCCAGGTCTACAATAATTTCCC 
TGCTCTCCTTGATTATTTCCCGGGAACTCACAACAAATTACTTAAAAACGTTGCTTTTATGAAAAGTTATATTTTGGAAAAAGTAA 
AAGAACACCAAGAATCAATGGACATGAACAACCCTCAGGACTTTATTGATTGCTTCCTGATGAAAATGGAGAAGGAAAAGCACAAC 
CAACCATCTGAATTTACTATTGAAAGCTTGGAAAACACTGCAGTTGACTTGTTTGGAGCTGGGACAGAGACGACAAGCACAACCCT 
GAGATATGCTCTCCTTCTCCTGCTGAAGCACCCAGAGGTCACAGCTAAAGTCCAGGAAGAGATTGAACGTGTGATTGGCAGAAACC 
GGAGCCCCTGCATGCAAGACAGGAGCCACATGCCCTACACAGATGCTGTGGTGCACGAGGTCCAGAGATACATTGACCTTCTCCCC 
ACCAGCCTGCCCCATGCAGTGACCTGTGACATTAAATTCAGAAACTATCTCATTCCCAAGGGCACAACCATATTAATTTCCCTGAC 
TTCTGTGCTACATGACAACAAAGAATTTCCCAACCCAGAGATGTTTGACCCTCATCACTTTCTGGATGAAGGTGGCAATTTTAAGA 
AAAGTAAATACTTCATGCCTTTCTCAGCAGGAAAACGGATTTGTGTGGGAGAAGCCCTGGCCGGCATGGAGCTGTTTTTATTCCTG 
ACCTCCATTTTACAGAACTTTAACCTGAAATCTCTGGTTGACCCAAAGAACCTTGACACCACTCCAGTTGTCAATGGATTTGCCTC 
TGTGCCGCCCTTCTACCAGCTCTGCTTCATTCCTGTCCACCACCACCACTGA 

SEQ ID NO: 6 

MAKKTSSKGRPPGPTPLPVIGNILQIGIKDISKSLTNLSKVYGPVFTLYFGLKPIVVLHGYEAVKEALIDLGEEFSGRGIFPLAER 

ANRGFGIVFSNGKKWKEIRRFSLMTLRNEX3MGKRSIEDRVQEEARCLVEELRKTKASPCDPTFILGCAPCNVICSIIFHKRFOYKD 

QQFLNIJ4EKLNENIKILSSPWIQVYNNFPALLDYFPGTHNKLLKNVAFMKSYILEKVKEHQESMDMmJPQDFIDCFLMKM 

QPSEFTIESLENTAVDLFGAGTETTSTTLRYALLLLLKHPEVTAKVQEEIERVIGRNRSPCMQDRSHMPYTDAWHEVQRYIDLLP 

TSLPHAVTCDIKFRNYLIPKGTTILISLTSVLHDNKEFPNPEMFDPHHFLDEGGNFKKSKYFMPFSAGKRICVGEALAGMELFLFL 

TSILQNFNLKSLVDPKNLDTTPVVNGFASVPPFYQLCFIPVHHHH 

1155 <2C9-FGloop K206E) 
SEQ ID NO: 7 

ATGGCTAAGAAAACGAGCTCTAAAGGGCGGCCGCCTGGCCCCACTCCTCTCCCAGTGATTGGAAATATCCTACAGATAGGTATTAA 
GGACATCAGCAAATCCTTAACCAATCTCTCAAAGGTCTATGGCCCTGTGTTCACTCTGTATTTTGGCCTGAAACCCATAGTGGTGC 
TGCATGGATATGAAGCAGTGAAGGAAGCCCTGATTGATCTTGGAGAGGAGTTTTCTGGAAGAGGCATTTTCCCACTGGCTGAAAGA 
GCTAACAGAGGATTTGGAATTGTTTTCAGCAATGGAAAGAAATGGAAGGAGATCCGGCGTTTCTCCCTCATGACGCTGCGGAATTT 
TGGGATGGGGAAGAGGAGCATTGAGGACCGTGTTCAAGAGGAAGCCCGCTGCCTTGTGGAGGAGTTGAGAAAAACCAAGGCCTCAC 
CCTGTGATCCCACTTTCATCCTGGGCTGTGCTCCCTGCAATGTGATCTGCTCCATTATTTTCCATAAACGTTTTGATTATAAAGAT 
CAGCAATTTCTTAACTTAATGGAAAAGTTGAATGAAAACATCGAGATTTTGAGCAGCCCCTGGATCCAGGTCTACAATAATTTCCC 
TGCTCTCCTTGATTATTTCCCGGGAACTCACAACAAATTACTTAAAAACGTTGCTTTTATGAAAAGTTATATTTTGGAAAAAGTAA 
MGAACACCAAGAATCAATGGACATGAACAACCCTCAGGACTTTATTGATTGCTTCCTGATGAAAATGGAGAAGGAAAAGCACAAC 
CAACCATCTGAATTTACTATTGAAAGCTTGGAAAACACTGCAGTTGACTTGTTTGGAGCTGGGACAGAGACGACAAGCACAACCCT 
GAGATATGCTCTCCTTCTCCTGCTGAAGCACCCAGAGGTCACAGCTAAAGTCCAGGAAGAGATTGAACGTGTGATTGGCAGAAACC 
GGAGCCCCTGCATGC AAG AC AGGAGC C ACATGC CCT AC AC AG ATGCTGTGGTGCACGAGGT C CAGAGAT AC AT TG ACCT T CT C CCC 
ACCAGCCTGCCCCATGCAGTGACCTGTGACATTAAATTCAGAAACTATCTCATTCCCAAGGGCACAACCATATTAATTTCCCTGAC 
TTCTGTGCTACATGACAACAAAGAATTTCCCAACCCAGAGATGTTTGACCCTCATCACTTTCTGGATGAAGGTGGCAATTTTAAGA 
AAAGTAAATACTTCATGCCTTTCTCAGCAGGAAAACGGATTTGTGTGGGAGAAGCCCTGGCCGGCATGGAGCTGTTTTTATTCCTG 
ACCTCCATTTTACAGAACTTTAACCTGAAATCTCTGGTTGACCCAAAGAACCTTGACACCACTCCAGTTGTCAATGGATTTGCCTC 
TGTGCCGCCCTTCTACCAGCTCTGCTTCATTCCTGTCCACCACCACCACTGA 

SEQ ID NO: 8 

MAKKTSSKGRPPGPTPLPVIGNILQIGIKDISKSLTNLSKVYGPVFTLYFGLKPIWLHGYEAVKEALIDLGEEFSGRGIFPLAER 
ANRG FG I V FS NGKKWKE I RR FS LMT LRN FGMG KRS I E DRVQE E1ARC LVEE LRKT KAS PC D PT F I LGCAPCN VI CSIIFHKRFDYKD 
QQFLNLMEKLNENIEILSSPWIQVYTINFPALLDYFPGTHNKLLKNVAFMKSYILEBCVKEHQESMD^4NNPQDFIDCFLMKMEKEKHN 
QPSEFTIESLENTAVDLFGAGTETTSTTLRYALLLLLKHPEVTAKVQEEIERVIGRNRSPCMQDRSHMPYTDAWHEVQRYIDLLP 
TSLPHAVTCDIKFRNYLIPKGTTILISLTSVLHDNKEFPNPEMFDPHHFLDEGGNFKKSKYFMPFSAGKRICVGEALAGMELFLFL 
T S I LQN FN LKS LVD PKN LDTT P VVNG FAS V P P FYQLC FI PVHHHH 



1078 SEQ ID NO: 9 & 10 

ATGGCTAAGAAAACGAGCTCTAAAGGGCGGCCGCCTGGCCCCACTCCTCTCCCAGTGATTGGAAATATCCTACAGATAGGTATTAA 
GG AC AT C AGC AAATCCT T AACCAAT CT CTC AAAGGTCT ATGGCCCTGTGT TC ACTCTGT AT TTTGGCCTG AAACCC AT AGTGGT G C 
TGCATGGATATGAAGCAGTGAAGGAAGCCCTGATTGATCTTGGAGAGGAATTCTCTGGAAGAGGCATTTTCCCACTGGCTGAAAGA 
GCTAACAGAGGATTTGGAATTGTTTTCAGCAATGGAAAGAAATGGAAGGAGATCCGGCGTTTCTCCCTCATGACGCTGCGGAATTT 
TGGGATGGGGAAGAGGAGCATTGAGGACCGTGTTCAAGAGGAAGCCCGCTGCCTTGTGGAGGAGTTGAGAAAAACCAAGGCCTCAC 
CCTGTGATCCCACTTTCATCCTGGGCTGTGCTCCCTGCAATGTGATCTGCTCCATTATTTTCCATAAGCGCTTTGATTATAAAGAT 
CAGCAATTTCTTAACTTAATGGAAAAGTTGAATGAAAACATCAAGATTTTGAGCAGCCCCTGGATCCAGATCTGCAATAATTTTCC 
GCCTATCATTGATTACTTCCCGGGAACTCACAACAAATTACTTAAAAACGTTGCTTTTATGAAAAGTTATATTTTGGAAAAAGTAA 
AAGAACACCAAGAATCAATGGACATGAACAACCCTCAGGACTTTATTGATTGCTTCCTGATGAAAATGGAGAAGGAAAAGCACAAC 
CAACCATCTGAATTTACTATTGAAAGCTTGGAAAACACTGCAGTTGACTTGTTTGGAGCTGGGACAGAGACGACAAGCACAACCCT 
GAGATATGCTCTCCTTCTCCTGCTGAAGCACCCAGAGGTCACAGCTAAAGTCCAGGAAGAGATTGAACGTGTGATTGGCAGAAACC 
GGAGCCCCTGCATGCAAGACAGGAGCCACATGCCCTACACAGATGCTGTGGTGCACGAGGTCCAGAGATACATTGACCTTCTCCCC 
ACCAGCCTGCCCCATGCAGTGACCTGTGACATTAAATTCAGAAACTATCTCATTCCCAAGGGCACAACCATATTAATTTCCCTGAC 
TTCTGTGCTACATGACAACAAAGAATTTCCCAACCCAGAGATGTTTGACCCTCATCACTTTCTGGATGAAGGTGGCAATTTTAAGA 
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AAAGTAAATACTTCATGCCTTTCTCAGCAGGAAAACGGATTTGTGTGGGAGAAGCCCTGGCCGGCATGGAGCTGTTTTTATTCCTG 
ACCTCCATTTTACAGAACTTTAACCTGAAATCTCTGGTTGACCCAAAGAACCTTGACACCACTCCAGTTGTCAATGGATTTGCCTC 
TGTGCCGCCCTTCTACCAGCTGTGCTTCATTCCTGTCCATCATCATCATTGA 

MAKKTSSKGRPPGPTPLPVIGNILQIGIKDISKSLTNLSKVYGPVFTLYFGLKPIVVLHGYEAVKEALIDLGEEFSGRGIFPLAER 
ANRGFGIVFSNGKKWKEIRRFSLMTLRNFGMGKRSIEDRVQEEARCLVEELRKTKASPCDPTFILGCAPCNVICSIIFHKRFDYKD 
QQFLNLMEKLNENIKILSSPWIQICNNFPPIIDYFPGTHNKLLKNVAFMKSYILEKVKEHQ^^ 

QPSEFTIESLENTAVDLFGAGTETTSTTLRYALLLLLKHPEVTAKVQEEIERVIGRNRSPCMQDRSHMPYTDAWHEVQRYIDLLP 
TSLPHAVTCDIKFRNYLIPKGTTILISLTSVLHDNKEFPNPEMFDPHHFLDEGGNFKKSKYFMPFSAGKRICVGEALAGMELFLFL 
T S I LQN FN LK S LV D PKNL DTT P WNG FAS V P P FYQLC F I P VHHHH 



1081 SEQ ID NO: 11 & 12 

ATGGCTAAGAAAACGAGCTCTAAAGGGCGGCCGCCTGGCCCCACTCCTCTCCCAGTGATTGGAAATATCCTACAGATAGGTATTAA 
GGACATCAGCAAATCCTTAACCAATCTCTCAAAGGTCTATGGCCCTGTGTTCACTCTGTATTTTGGCCTGAAACCCATAGTGGTGC 
TGCATGGATATGAAGCAGTGAAGGAAGCCCTGATTGATCTTGGAGAGGAGTTTTCTGGAAGAGGCATTTTCCCACTGGCTGAAAGA 
GCTAACAGAGGATTTGGAATTGTTTTCAGCAATGGAAAGAAATGGAAGGAGATCCGGCGTTTCTCCCTCATGACGCTGCGGAATTT 
TGGGATGGGGAAGAGGAGCATTGAGGACCGTGTTCAAGAGGAAGCCCGCTGCCTTGTGGAGGAGTTGAGAAAAACCAAGGCCTCAC 
CCTGTGATCCCACTTTCATCCTGGGCTGTGCTCCCTGCAATGTGATCTGCTCCATTATTTTCCATAAACGTTTTGATTATAAAGAT 
CAGCAATTTCTTAACTTAATGGAAAAGTTGAATGAAAACATCAAGATTTTGAGCAGCCCCTGGATCCAGGTCTACAATAATTTCCC 
TGCTCTCCTTGATTATTTCCCGGGAACTCACAACAAATTACTTAAAAACGTTGCTTTTATGAAAAGTTATATTTTGGAAAAAGTAA 
AAGAACACCAAGAATCAATGGACATGAACAACCCTCAGGACTTTATTGATTGCTTCCTGATGAAAATGGAGAAGGAAAAGCACAAC 
CAACCATCTGAATTTACTATTGAAAGCTTGGAAAACACTGCAGTTGACTTiSTTTGGAGCTGGGACAGAGACGACAAGCACAACCCT 
GAGATATGCTCTCCTTCTCCTGCTGAAGCACCCAGAGGTCACAGCTAAAGTCCAGGAAGAGATTGAACGTGTGATTGGCAGAAACC 
GGAGCCCCTGCATGCAAGACAGGAGCCACATGCCCTACACAGATGCTGTGGTGCACGAGGTCCAGAGATACATTGACCTTCTCCCC 
ACCAGCCTGCCCCATGCAGTGACCTGTGACATTAAATTCAGAAACTATCTCATTCCCAAGGGCACAACCATATTAATTTCCCTGAC 
TTCTGTGCTAGATGACAACAAAGAATTTCCCAACCCAGAGATGTTTGACCCTCATCACTTTCTGGATGAAGGTGGCAATTTTAAGA 
AAAGTAAATACTTCATGCCTTTCTCAGCAGGAAAACGGATTTGTGTGGGAGAAGCCCTGGCCGGCATGGAGCTGTTTTTATTCCTG 
ACCTCCATTTTACAGMCTTTAACCTGAAATCTCTGGTTGACCCAAAGGACCTTGACACCACTCCAGTTGTCAATGGATTTGCCTC 
TGTGCCGCCCTTCTACCAGCTCT.GCTTCATTCCTGTCCACCACCACCACTGA 

MAKKTSSKGRPPGPTPLPVIGNILQIGIKD^ 

AN RG FG I V F SNGKKWKE I RR FS LMT LRN FGMGKRS I E DR VQEEARC LVEE LRKTKAS P C D PT FI LG C A PCN V I C S 1 1 FHKR F D YK D 
QQFLNLMEKLNENIKILSSPWIQVYNNFPALLDYFPGTHNKLLKNVAFMKSYILEKVKEHQESMDMNNPQDFIDCFLMKMEKEKHN 
QPSEFTIESLENTAVDLFGAGTETTSTTLRYALLLLLKHPEVTAKVQEEIERVIGRNRSPCMQDRSHMPYTDAVVHEVQRYIDLLP 
TSLPHAVTCDIKFRNYLIPKGTTILISLTSVLHDNKEFPNPEMFDPHHFLDEGGNFKKSKYFMPFSAGKRICVGEALAGMELFLFL 
TSILQNFNLKSLVDPKDLDTTPWNGFASVPPFYQLCFIPVHHHH 



1082 SEQ ID NO: 13 & 14 

ATGGCTAAGAAAACGAGCTCTAAAGGGCQGCCGCCTGGCCCCACTCCTCTCCCAGTGATTGGAAATATCCTACAGATAGGTATTAA 
GGACATCAGCAAATCCTTAACCAATCTCTCAAAGGTCTATGGCCCTGTGTTCACTCTGTATTTTGGCCTGAAACCCATAGTGGTGC 
TGCATGGATATGAAGCAGTGAAGGAAGCCCTGATTGATCTTGGAGAGGAGTTTTCTGGAAGAGGCATTTTCCCACTGGCTGAAAGA 
GCTAACAGAGGATTTGGAATTGTTTTCAGCAATGGAAAGAAATGGAAGGAGATCCGGCGTTTCTCCCTCATGACGCTGCGGAATTT 
TGGGATGGGGAAGAGGAGCATTGAGGACCGTGTTCAAGAGGAAGCCCGCTGCCTTGTGGAGGAGTTGAGAAAAACCAAGGCCTCAC 
CCTGTGATCCCACTTTCATCCTGGGCTGTGCTCCCTGCAATGTGATCTGCTCCATTATTTTCCATAAACGTTTTGATTATAAAGAT 
CAGCAATTTCTTAACTTAATGGAAAAGTTGAATGAAAACATCAAGATTTTGAGCAGCCCCTGGATCCAGGTCTACAATAATTTCCC 
TGCTCTCCTTGATTATTTCCCGGGAACTCACAACAAATTACTTAAAAACGTTGCTTTTATGAAAAGTTATATTTTGGAAAAAGTAA 
AAGAACACCAAGAATCAATGGACATGAACAACCCTCAGGACTTTATTGATTGCTTCCTGATGAAAATGGAGAAGGAAAAGCACAAC 
CAACCATCTGAATTTACTATTGAAAGCTTGGAAAACACTGCAGTTGACTTGTTTGGAGCTGGGACAGAGACGACAAGCACAACCCT 
GAGATATGCTCTCCTTCTCCTGCTGAAGCACCCAGAGGTCACAGCTAAAGTCCAGGAAGAGATTGAACGTGTGATTGGCAGAAACC 
GGAGCCCCTGCATGCAAGACAGGAGCCACATGCCCTACACAGATGCTGTGGTGCACGAGGTCCAGAGATACATTGACCTTCTCCCC 
ACCAGCCTGCCCCATGCAGTGACCTGTGACATTAAATTCAGAAACTATCTCATTCCCAAGGGCACAACCATATTAATTTCCCTGAC 
TTCTGTGCTACATGACAACAAAGAATTTCCCAACCCAGAGATGTTTGACCCTCATCACTTTCTGGATGAAGGTGGCAATTTTAAGA 
AAAGTAAATACTTCATGCCTTTCTCAGCAGGAAAACGGATTTGTGTGGGAGAAGCCCTGGCCGGCATGGAGCTGTTTTTATTCCTG 
ACCTCCATTTTACAGAACTTTAACCTGAAATCTCTGGTTGACCCAAAGAACCTTGACACCACTCCAGTTGTCAATGGATTTGCCTC 
TGTGCCGCCCTCCTACCAGCTCTGCTTCATTCCTGTCCACCACCACCACTGA 

MAKKTSSKGRPPGPTPLPVIGNILQIGIKDISKSLTNLSKVYGPVFTLYFGLKPIVVLHGYEAVKEALIDLGEEFSGRGIFPLAER 
ANRGFGIVFSNGKKWKEIRRFSLMTLRNFGMGKRSIEDRVQEEARCLVEELRKTKASPCDPTFILGCAPCNVICSIIFHKRFDYKD 
QQFLNLMEKLNENIKILSSPWIQVYNNFPALLDYFPGTHNKLLKNVAFMKSYILEKVKEHQESMDMNNPQDFIDCFLMKMEKEKHN 
QPSEFTIESLENTAVDLFGAGTETTSTTLRYALLLLLKHPEVTAKVQEEIERVIGRNRSPCMQDRSHMPYTDAVVHEVQRYIDLLP 
TSLPHAVTCDIKFRNYLIPKGTTILISLTSVLHDNKEFPNPEMFDPHHFLDEGGNFKKSKYFMPFSAGKRICVGEALAGMELFLFL 
TSILQNFNLKSLVDPKNLDTTPWNGFASVPPSYQLCFIPVHHHH 
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1085 SEQ ID NO: 15 & 16 

ATGGCTAAGAAAACGAGCTCTAAAGGGCGGCCGCCTGGCCCCACTCCTCTCCCAGTGATTGGAAATATCCTACAGATAGGTATTAA 
GGACATCAGCAAATCCTTAACCAATCTCTCAAAGGTCTATGGCCCTGTGTTCACTCTGTATTTTGGCCTGAAACCCATAGTGGTGC 
TGCATGGATATGAAGCAGTGAAGGAAGCCCTGATTGATCTTGGAGAGGAGTTTTCTGGAAGAGGCATTTTCCCACTGGCTGAAAGA 
GCTAACAGAGGATTTGGAATTGTTTTCAGCAATGGAAAGAAATGGAAGGAGATCCGGCGTTTCTCCCTCATGACGCTGCGGAATTT 
TGGGATGGGGAAGAGGAGCATTGAGGACCGTGTTCAAGAGGAAGCCCGCTGCCTTGTGGAGGAGTTGAGAAAAACCAAGGCCTCAC 
CCTGTGATCCCACTTTCATCCTGGGCTGTGCTCCCTGCAATGTGATCTGCTCCATTATTTTCCATAAACGTTTTGATTATAAAGAT 
GAGCAATTTCTTAACTTAATGGAAAAGTTGAATGAAAACATCAAGATTTTGAGCAGCCCCTGGATCCAGGTCTACAATAATTTCCC 
TGCTCTCCTTGATTATTTCCCGGGAACTCACAACAAATTACTTAAAAACGTTGCTTTTATGAAAAGTTATATTTTGGAAAAAGTAA 
AAGAACACCAAGAATCAATGGACATGAACAACCCTCAGGACTTTATTGATTGCTTCCTGATGAAAATGGAGAAGGAAAAGCACAAC 
CAACCATCTGAATTTACTATTGAAAGCTTGGAAAACACTGCAGTTGACTTGTTTGGAGCTGGGACAGAGACGACAAGCACAACCCT 
GAGATATGCTCTCCTTCTCCTGCTGAAGCACCCAGAGGTCACAGCTAAAGTCCAGGAAGAGATTGAACGTGTGATTGGCAGAAACC 
GGAGCCCCTGCATGCAAGACAGGAGCCACATGCCCTACACAGATGCTGTGGTGCACGAGGTCCAGAGATACATTGACCTTCTCCCC 
ACCAGCCTGCCCCATGCAGTGACCTGTGACATTAAATTCAGAAACTATCTCATTCCCAAGGGCACAACCATATTAATTTCCCTGAC 
TTCTGTGCTACATGACAACAAAGAATTTCCCAACCCAGAGATGTTTGACCCTCATCACTTTCTGGATGAAGGTGGCAATTTTAAGA 
AAAGTAAATACTTCATGCCTTTCTCAGCAGGAAAACGGATTTGTGTGGGAGAAGCCCTGGCCGGCATGGAGCTGTTTTTATTCCTG 
ACCTCCATTTTACAGAACTTTAACCTGAAATCTCTGGTTGACCCAAAGAACCTTGACACCACTCCAGTTGTCAATGGATTTGCCTC 
TGTGCCGCCCTTCTACCAGCTCTGCTTCATTCCTGTCCACCACCACCACTGA 

MAKKTSSKGRPPGPTPLPVIGNILQIGIKDISKSLTNLSKVYGPVFTLYFGLKPIVVLHGYEAVKEALIDLGEEFSGRGIFPLAER 
ANRG FG I V FSNG KKWKE I RR FS LMT LRN FGMGKRS I E DR VQE E ARC L VEE LRKT KAS PC D PT F I LGC APCN V I CS 1 1 FHKR FD YK D 
EQFLNLMEKLNENIKILSSPWIQVYNNFPALLDYFPGTHNKLLKNVAFMKSYILEKVKEHQESMDMNNPQDFI DCFLMKMEKEKHN 
QPSEFTIESLENTAVDLFGAGTETTSTTLRYALLLLLKHPEVTAKVQEEIERVIGRNRSPCMQDRSHMPYTDAWHEVQRYIDLLP 
TSLPHAVTCDIKFRNYLIPKGTTILISLTSVLHDNKEFPNPEMFDPHHFLDEGGNFKKSKYFMPFSAGKRICVGEALAGMELFLFL 
TSILQNFNLKSLVDPKNLDTTPWNGFASVPPFYQLCFIPVHHHH 



1097 SEQ ID NO: 17 & 18 ' 

ATGGCTAAGAAAACGAGCTCTAAAGGGCGGCCGCCTGGCCCCACTCCTCTCCCAGTGATTGGAAATATCCTACAGATAGGTATTAA 
GGACATCAGCAAATCCTTAACCAATCTCTCAAAGGTCTATGGCCCTGTGTTCACTCTGTATTTTGGCCTGAAACCCATAGTGGTGC 
TGCATGGATATGAAGCAGTGAAGGAAGCCCTGATTGATCTTGGAGAGGAGTTTTCTGGAAGAGGCATTTTCCCACTGGCTGAAAGA 
GCTAACAGAGGATTTGGAATTGTTTTCAGCAATGGAAAGAAATGGAAGGAGATCCGGCGTTTCTCCCTCATGACGCTGCGGAATTT 
TGGGATGGGGAAGAGGAGCATTGAGGACCGTGTTCAAGAiGGAAGCCCGCTGCCTTGTGGAGGAGTTGAGAAAAACCAAGGCCTCAC 
CCTGTGATCCCACTTTCATCCTGGGCTGTGCTCCCTGCAATGTGATCTGCTCCATTATTTTCCATAAACGTTTTGATTATAAAGAT 
C^GGAATTTCTTAACTTAATGGAAAAGTTGAATGAAAACATC^GATTTTGAGCAGCCCCTGGATCCAGGTCTACAATAATTTCCC 
TGCTCTCCTTGATTATTTCCCGGGAACTCACAACAAATTACTTAAAAACGTTGCTTTTATGAAAAGTTATATTTTGGAAAAAGTAA 
AAGAACACCAAGAATCAATGGACATGAACAACCCTCAGGACTTTATTGATTGCTTCCTGATGAAAATGGAGAAGGAAAAGCACAAC 
CAACCATCTGAATTTACTATTGAAAGCTTGGAAAACACTGCAGTTGACTTGTTTGGAGCTGGGACAGAGACGACAAGCACAACCCT 
GAGATATGCTCTCCTTCTCCTGCTGAAGCACCCAGAGGTCACAGCTAAAGTCCAGGAAGAGATTGAACGTGTGATTGGCAGAAACC 
GGAGCCCCTGCATGCAAGACAGGAGCCACATGCCCTACACAGATGCTGTGGTGCACGAGGTCCAGAGATACATTGACCTTCTCCCC 
ACCAGCCTGCCCCATGCAGTGACCTGTGACATTAAATTCAGAAACTATCTCATTCCCAAGGGCACAACCATATTAATTTCCCTGAC 
TTCTGTGCTACATGACAACAAAGAATTTCCCAACCCAGAGATGTTTGACCCTCATCACTTTCTGGATGAAGGTGGCAATTTTAAGA 
AAAGTAAATACTTCATGCCTTTCTCAGCAGGAAAACGGATTTGTGTGGGAGAAGCCCTGGCCGGCATGGAGCTGTTTTTATTCCTG 
ACCTCCATTTTACAGAACTTTAACCTGAAATCTCTGGTTGACCCAAAGAACCTTGACACCACTCCAGTTGTCAATGGATTTGCCTC 
TGTGCCGCCCTTCTACCAGCTCTGCTTCATTCCTGTCCACCACCACCACTGA 

MAKKTSSKGRPPGPTPLPVIGNILQIGIKDISKSLTNLSKVYGPVFTLYFGLKPIWLHGYEAVKEALIDLGEEFSGRGIFPLAER 
ANRGFGIVFSNGKKWKEIRRFSLMTLRNFGMGKRSIEDRVQEEARCLVEELRKTKASPCDPTFILGCAPCNVICSIIFHKRFDYKD 
QEFLNLMEKLNENIKILSSPWI Q VYNN F P ALLOY FPGT HN KLLKN VA FMKS Y I LEKVKEHQE S M DMNN PQ D FI DC FLMKME KE KHN 
QPSEFTIESLENTAVDLFGAGTETTSTTLRYALLLLLKHPEVTAKVQEEIERVIGRNRSPCMQDRSHMPYTDAWHEVQRYIDLLP 
TSLPHAVTCDIKFRNYLIPKGTTILISLTSVLHDNKEFPNPEMFDPHHFLDEGGNFKKSKYFMPFSAGKRICVGEALAGMELFLFL 
TSILQNFNLKSLVDPKNLDTTPWNGFASVPPFYQLCFIPVHHHH 



1100 SEQ ID NO: 19 & 20 

ATGGCTAAGAAAACGAGCTCTAAAGGGCGGCCGCCTGGCCCCACTCCTCTCCCAGTGATTGGAAATATCCTACAGATAGGTATTAA 
GGACATCAGCAAATCCTTAACCAATCTCTCAAAGGTCTATGGCCCTGTGTTCACTCTGTATTTTGGCCTGAAACCCATAGTGGTGC 
TGCATGGATATGAAGCAGTGAAGGAAGCCCTGATTGATCTTGGAGAGGAGTTTTCTGGAAGAGGCATTTTCCCACTGGCTGAAAGA 
G CT AAC AG AGG ATTTGGAATTGTTTT C AGC AAT GG AAAG AAATGG AAGG AG ATC CGG CGTTTCT CCCT CAT G ACGCT G CGG AATT T 
TGGGATGGGGAAGAGGAGCATTGAGGACCGTGTTCAAGAGGAAGCCCGCTGCCTTGTGGAGGAGTTGAGAAAAACCAAGGCCTCAC 
CCTGTGATCCCACTTTCATCCTGGGCTGTGCTCCCTGCAATGTGATCTGCTCCATTATTTTCCATAAACGTTTTGATTATAAAGAT 
CAGCAATTTCTTAACTTAATGGAAAAGTTGAATGAAAACATCAAGATTTTGAGCAGCCCCTGGATCCAGGTCTACAATAATTTCCC 
TGCTCTCCTTGATTATTTCCCGGGAACTCACAACAAATTACTTAAAAACGTTGCTTTTATGAAAAGTTATATTTTGGAAAAAGTAA 
AAGAACACCMAAATCAATGGACATGAACAACCCTCAGGACTTTATTGATTGCTTCCTGATGAAAATGGAGAAGGAAAAGCACAAC 
CAACCATCTGAATTTACTATTGAAAGCTTGGAAAACACTGCAGTTGACTTGTTTGGAGCTGGGACAGAGACGACAAGCACAACCCT 
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GAGATATGCTCTCCTTCTCCTGCTGAAGCACCCAGAGGTCACAGCTAAAGTCCAGGAAGAGATTGAACGTGTGATTGGCAGAAACC 
GGAGCCCCTGCATGCAAGACAGGAGCCACATGCCCTACACAGATGCTGTGGTGCACGAGGTCCAGAGATACATTGACCTTCTCCCC 
ACCAGCCTGCCCCATGCAGTGACCTGTGACATTAAATTCAGAAACTATCTCATTCCCAAGGGCACAACCATATTAATTTCCCTGAC 
TTCTGTGCTACATGACAACAAAGAATTTCCCAACCCAGAGATGTTTGACCCTCATCACTTTCTGGATGAAGGTGGCAATTTTAAGA 
AAAGTAAATACTTCATGCCTTTCTCAGCAGGAAAACGGATTTGTGTGGGAGAAGCCCTGGCCGGCATGGAGCTGTTTTTATTCCTG 
ACCTCCATTTTACAGAACTTTAACCTGAAATCTCTGGTTGACCCAAAGAACCTTGACACCACTCCAGTTGTCAATGGATTTGCCTC 
TGTGCCGCCCTTCTACCAGCTCTGCTTCATTCCTGTCCACCACCACCACTGA 

MAKKTSSKGRPPGPTPLPVIGNILQIGIKDISKSLTNLSKVYGPVFTLYFGLKPIVVLHGYEAVKEALIDLGEEFSGRGIFPLAER 
ANRGFGIVFSNGKKWKEIRRFSLMTLRNFGMGKRSIEDRVQEEARCLVEELRKTKASPCDPTFILGCAPCNVICSIIFHKRFDYKD 
QQFLN LME KLNEN IKILSSPWI QV YNN F PAL L D Y F PGTHNK LLKN V A FMK S Y I LEKVKEHQKSM DMN N PQ D F I DC FLMKME KE KHN 
QPSEFTIESLENTAVDLFGAGTETTSTTLRYALLLLLKHPEVTAKVQEEIERVIGRNRSPCMQDRSHMPYTDAWHEVQRYIDLLP 
TSLPHAVTCDIKFRNYLIPKGTTILISLTSVLHDNKEFPNPEMFDPHHFLDEGGNFKKSKYFMPFSAGKRICVGEALAGMELFLFL 
TSILQNFNLKSLVDPKNLDTTPVVNGFASVPPFYQLCFIPVHHHH 



1101 SEQ ID NO:21 & 22 

ATGGCTAAGAAAACGAGCTCTAAAGGGCGGCCGCCTGGCCCCACTCCTCTCCCAGTGATTGGAAATATCCTACAGATAGGTATTAA 
GGACATCAGCAAATCCTTAACCAATCTCTCAAAGGTCTATGGCCCTGTGTTCACTCTGTATTTTGGCCTGAAACCCATAGTGGTGC 
TGCATGGATATGAAGCAGTGAAGGAAGCCCTGATTGATCTTGGAGAGGAGTTTTCTGGAAGAGGCATTTTCCCACTGGCTGAAAGA 
GCTAACAGAGGATTTGGAATTGTTTTCAGCAATGGAAAGAAATGGAAGGAGATCCGGCGTTTCTCCCTCATGACGCTGCGGAATTT 
TGGGATGGGGAAGAGGAGCATTGAGGACCGTGTTCAAGAGGAAGCCCGCTGCCTTGTGGAGGAGTTGAGAAAAACCAAGGCCTCAC 
CCTGTGATCCCACTTTCATCCTGGGCTGTGCTCCCTGCAATGTGATCTGCTCCATTATTTTCCATAAACGTTTTGATTATAAAGAT 
CAGCAATTTCTTAACTTAATGGAAAAGTTGAATGAAAACATCAAGATTTTGAGCAGCCCCTGGATCCAGGTC'TACAATAATTTCCC 
TGCTCTCCTTGATTATTTCCCGGGAACTCACAACAAATTACTTAAAAACGTTGCTTTTATGAAAAGTTATATTTTGGAAAAAGTAA 
AAGAACACCAAGAATCAATGGACATGAACAACCCTCAGGACTTTATTGATTGCTTCCTGATGAAAATGGAGCAGGAAAAGCACAAC 
CAACCATCTGAATTTACTATTGAAAGCTTGGAAAACACTGCAGTTGACTTGTTTGGAGCTGGGACAGAGACGACAAGCACAACCCT 
GAGATATGCTCTCCTTCTCCTGCTGAAGCACCCAGAGGTCACAGCTAAAGTCCAGGAAGAGATTGAACGTGTGATTGGCAGAAACC 
GGAGCCCCTGCATGCAAGACAGGAGCCACATGCCCTACACAGATGCTGTGGTGCACGAGGTCCAGAGATACATTGACCTTCTCCCC 
ACCAGCCTGCCCCATGCAGTGACCTGTGACATTAAATTCAGAAACTATCTCATTCCCAAGGGCACAACCATATTAATTTCCCTGAC 
TTCTGTGCTACATGACAACAAAGAATTTCCCAACCCAGAGATGTTTGACCCTCATCACTTTCTGGATGAAGGTGGCAATTTTAAGA 
AAAGTAAATACTTCATGCCTTTCTCAGCAGGAAAACGGATTTGTGTGGGAGAAGCCCTGGCCGGCATGGAGCTGTTTTTATTCCTG 
ACCTCCATTTTACAGAACTTTAACCTGAAATCTCTGGTTGACCCAAAGAACCTTGACACCACTCCAGTTGTCAATGGATTTGCCTC 
TGTGCCGCCCTTCTACCAGCTCTGCTTCATTCCTGTCCACCACCACCACTGA ? ' : 

MAKKTSSKGRPPGPTPLPVIGNILQIGIKDISKSLTNLSKVYGPVFTLYFGLKPIWLHGYEAVK^LIDLGEEFSGRGIFPLAER 
ANRG FG I V FS NG KKWKE I RR FS LMT LRN FGMGKRS I E DRVQE E ARCL VEE LRKT KAS PC D PT FI LG C A PCN V ICSIIFHKRFDYKD 
QQFLNIMEKLNENIKILSSPWIQVYNNFPALLDYFPGTHNKLL^ 

QPSEFTIESLENTAVDLFGAGTETTSTTLRYALLLLLKHPEVTAKVQEEIERVIGRNRSPCMQDRSHMPYTDAWHEVQRYIDLLP 
TSLPHAVTCDIKFRNYLIPKGTTILISLTSVLHDNKEFPNPEMFDPHHFLDEGGNFKKSKYFMPFSAGKRICVGEALAGMELFLFL 
TSILQNFNLKSLVDPKNLDTTPWNGFASVPPFYQLCFIPVHHHH 



1102 SEQ ID NO: 23 & 24 

AT GGCTAAGAAAACG AGC T CT AAAGGGCGGCCGCCTGGCCCC ACT CCTCTCC C AGTG ATTGG AAAT AT CCT AC AG AT AGGT ATT AA 
GGACATCAGCAAATCCTTAACCAATCTCTCAAAGGTCTATGGCCCTGTGTTCACTCTGTATTTTGGCCTGAAACCCATAGTGGTGC 
TGCATGGATATGAAGCAGTGAAGGAAGCCCTGATTGATCTTGGAGAGGAGTTTTCTGGAAGAGGCATTTTCCCACTGGCTGAAAGA 
GCTAACAGAGGATTTGGAATTGTTTTCAGCAATGGAAAGAAATGGAAGGAGATCCGGCGTTTCTCCCTCATGACGCTGCGGAATTT 
TGGGATGGGGAAGAGGAGCATTGAGGACCGTGTTCAAGAGGAAGCCCGCTGCCTTGTGGAGGAGTTGAGAAAAACCAAGGCCTCAC 
CCTGTGATCCCACTTTCATCCTGGGCTGTGCTCCCTGCAATGTGATCTGCTCCATTATTTTCCATAAACGTTTTGATTATAAAGAT 
CAGCAATTTCTTAACTTAATGGAAAAGTTGAATGAAAACATCAAGATTTTGAGCAGCCCCTGGATCCAGGTCTACAATAATTTCCC 
TGCTCTCCTTGATTATTTCCCGGGAACTCACAACAAATTACTTAAAAACGTTGCTTTTATGAAAAGTTATATTTTGGAAAAAGTAA 
AAG AAC ACC AAG AATC AATGG AC ATG AAC AACCCT CAGGACTTT ATTG AT TGCTT CCTG AT G AAAATGGAG AAGG AAG AC G AC AAC 
CAACCATCTGAATTTACTATTGAAAGCTTGGAAAACACTGCAGTTGACTTGTTTGGAGCTGGGACAGAGACGACAAGCACAACCCT 
GAGATATGCTCTCCTTCTCCTGCTGAAGCACCCAGAGGTCACAGCTAAAGTCCAGGAAGAGATTGAACGTGTGATTGGCAGAAACC 
GGAGCCCCTGCATGCAAGACAGGAGCCACATGCCCTACACAGATGCTGTGGTGCACGAGGTCCAGAGATACATTGACCTTCTCCCC 
ACCAGCCTGCCCCATGCAGTGACCTGTGACATTAAATTCAGAAACTATCTCATTCCCAAGGGCACAACCATATTAATTTCCCTGAC 
TTCTGTGCTACATGACAACAAAGAATTTCCCAACCCAGAGATGTTTGACCCTCATCACTTTCTGGATGAAGGTGGCAATTTTAAGA 
AAAGTAAATACTTCATGCCTTTCTCAGCAGGAAAACGGATTTGTGTGGGAGAAGCCCTGGCCGGCATGGAGCTGTTTTTATTCCTG 
ACCTCCATTTTACAGAACTTTAACCTGAAATCTCTGGTTGACCCAAAGAACCTTGACACCACTCCAGTTGTCAATGGATTTGCCTC 
TGTGCCGCCCTTCTACCAGCTCTGCTTCATTCCTGTCCACCACCACCACTGA 

MAKKTSSKGRPPGPTPLPVIGNILQIGIKDISKSLTNLSKVYGPVFTLYFGLKPIVVLHGYEAVKEALIDLGEEFSGRGIFPLAER 
ANRGFGIVFSNGKKWKEIRRFSLMTLRNFGMGKRSIEDRVQEEARCLVEELRKTKASPCDPTFILGCAPCNVICSIIFHKRFDYKD 
QQFLN LMEKLNENIKILSSPWIQVYNNFPALLDYFPGTHNKLLKNVAFMKSYILEKVKEHQESMDMNNPQDFI DC FLMKMEKEDDN 
QPSEFTIESLENTAVDLFGAGTETTSTTLRYALLLLLKHPEVTAKVQEEIERVIGRNRSPCMQDRSHMPYTDAWHEVQRYIDLLP 
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TSLPHAVTCDIKFRNYLIPKGTTILISLTSVLHDNKEFPNPEMFDPHHFLDEGGNFKKSKYFMPFSAGKRICVGEALAGMELFLFL 
TSILQNFNLKSLVDPKNLDTTPVVNGFASVPPFYQLCFIPVHHHH 



1115 SEQ ID NO:25 & 26 

ATGGCTAAGAAAACGAGCTCTAAAGGGCGGCCGCCTGGCCCCACTCCTCTCCCAGTGATTGGAAATATCCTACAGATAGGTATTAA 
GGACATCAGCAAATCCTTAACCAATCTCTCAAAGGTCTATGGCCCTGTGTTCACTCTGTATTTTGGCCTGAAACCCATAGTGGTGC 
TGCATGGATATGAAGCAGTGAAGGAAGCCCTGATTGATCTTGGAGAGGAGTTTTCTGGAAGAGGCATTTTCCCACTGGCTGAAAGA 
GCTAACAGAGGATTTGGAATTGTTTTCAGCAATGGAAAGAAATGGAAGGAGATCCGGCGTTTCTCCCTCATGACGCTGCGGAATTT 
TGGGATGGGGAAGAGGAGCATTGAGGACCGTGTTCAAGAGGAAGCCCGCTGCCTTGTGGAGGAGTTGAGAAAAACCAAGGCCTCAC 
CCTGTGATCCCACTTTCATCCTGGGCTGTGCTCCCTGCAATGTGATCTGCTCCATTATTTTCCATAAACGTTTTGATTATAAAGAT 
CAGCAATTTCTTAACTTAATGGAAAAGTTGAATGAAAACATCAAGATTTTGAGCAGCCCCTGGATCCAGGTCTACAATAATTTCCC 
TGCTCTCCTTGATTATTTCCCGGGAACTCACAACAAATTACTTAAAAACGTTGCTTTTATGAAAAGTTATATTTTGGAAAAAGTAA 
AAGAACACCAAGAATCAATGGACATGAACAACCCTCAGGACTTTATTGATTGCTTCCTGATGAAAATGGAGAAGGAAAAGCACAAC 
CAACCATCTGAATTTACTATTGAAAGCTTGGAAAACACTGCAGTTGACTTGTTTGGAGCTGGGACAGAGACGACAAGCACAACCCT 
GAGATATGCTCTCCTTCTCCTGCTGAAGCACCCAGAGGTCACAGCTAAAGTCCAGGAAGAGATTGAACGTGTGATTGGCAGAAACC 
GGAGCCCCTGCATGCAAGACAGGAGCCACATGCCCTACACAGATGCTGTGGTGCACGAGGTCCAGAGATACATTGACCTTCTCCCC 
ACCAGCCTGCCCCATGCAGTGACCTGTGACATTAAATTCAGAAACTATCTCATTCCCAAGGGCACAACCATATTAATTTCCCTGAC 
TTCTGTGCTACATGACAACAAAGAATTTCCCAACCCAGAGATGTTTGACCCTCATCACTTTCTGGATGCAGGTGGCAATTTTAAGA 
AAAGTAAATACTTCATGCCTTTCTCAGCAGGAAAACGGATTTGTGTGGGAGAAGCCCTGGCCGGCATGGAGCTGTTTTTATTCCTG 
ACCTCCATTTTACAGAACTTTAACCTGAAATCTCTGGTTGACCCAAAGAACCTTGACACCACTCCAGTTGTCAATGGATTTGCCTC 
TGTGCCGCCCTTCTACCAGCTCTGCTTCATTCCTGTCCACCACCACCACTGA 

MAKKTSSKGRPPGPTPLPVIGNILQIGIKDISKSLTNLSKVYGPVFTLYFGLKPIVVLHGYEAVKEALIDLGEEFSGRGIFPLAER 
ANRGFGIVFSNGKKWKEIRRFSLMTLRNFGMGKRSIEDRVQEEARCLVEELRKTKASPCDPTFILGCAPCNVICSIIFHKRFDYKD 
QQFLNLMEKLNENIKILSSPWIQVYNNFPALLDYFPGTHNKLLKNVAFMKSYILEKVKEHQESMDMNNPQDFIDCFLMKMEKEKHN 
QPSEFTIESLENTAVDLFGAGTETTSTTLRYALLLLLKHPEVTAKVQEEIERVIGRNRSPCMQDRSHMPYTDAWHEVQRYIDLLP • .-.^ .,::>'.<.. 
TSLPHAVTCDIKFRNYLIPKGTTILISLTSVLHDNKEFPNPEMFDPHHFLDAGGNFKKSKYFMPFSAGKRICVGEALAGMELFLFL &y v,;,**: ;: <l.r - • :• 
TSIXQNFNLKSLVOPKNLDTTPWNGFASVPPFYQLCFIPVHHHH *.,0«nr v 

■■:•;.■;:-..>■■■■ -.v^v*-:. ■ ■- • • '' ■ 

111.6 s SEQ. ID*NO:27 & 28 : j/?. : V i: fOV. ' 

■ ;•> ■ ? " . -if ,* •■'■ ■" ■* V. - ; 

ATGGCTAAGAAAACGAGCTCTAAAGGGCGGCCGCCTGGCCCCACTCCTCTCCCAGTGATTGGAAATATCCTACAGATAGGTATTAA rfvGC'rP • A ? *o: * 
GGACATCAGCAAAT CCTT AACCAATCTCTCAAAGGTCT ATGGCCCTGTGTTCACTCTGTATTTTGGCCTGAAACCCAT AGTGGTGC 7 ;-; AC T. ; / . V 

TGCATGGATATGAAGCAGTGAAGGAAGCCCTGATTGATCTTGGAGAGGAGTTTTCTGGAAGAGGCATTTTCCCACTGGCTGAAAGA v^jft^r A'v - o:^.?; 
GCTAAGAGAGGATTTGGAATTGTTTTCAGCAATGGAAAGAAATGGAAGGAGATCCGGCGTTTCTCCCTCATGACGCTGCGGAATTT -it ;>,-Wv.\v vv.;v w 
TGGG ATGGGG AAG AGG AGC ATTGAGG ACCGTGT T C AAG AGG AAGCCCGCTGCCTTGTGGAGG AGTTGAGAAAAAC C AAGG CCT C AC 
CCTGTGATCCCACTTTCATCCTGGGCTGTGCTCCCTGCAATGTGATCTGCTCCATTATTTTCCATAAACGTTTTGATTATAAAGAT 
CAGCAATTTCTTAACTTAATGGAAAAGTTGAATGAAAACATCAAGATTTTGAGGAGCCCCTGGATCCAGGTCTACAATAATTTCCC 
TGCTCTCCTTGATTATTTCCCGGGAACTCACAACAAATTACTTAAAAACGTTGgTTTTATGAAAAGTTATATTTTGGAAAAAGTAA 
AAGAACACCAAGAATCAATGGACATGAACAACCCTCAGGACTTTATTGATTGC7TCCTGATGAAAATGGAGAAGGAAAAGCACAAC 
CAACCATCTGAATTTACTATTGAAAGCTTGGAAAACACTGCAGTTGACTTGTTTGGAGCTGGGACAGAGACGACAAGCACAACCCT 
GAGATATGCTCTCCTTCTCCTGCTGAAGCACCCAGAGGTCACAGCTAAAGTCCAGGAAGAGATTGAACGTGTGATTGGCAGAAACC 
GG AGCCC CTGC ATG CAAGAC AGG AGCC ACATGC C CTACAC AGATGCTGT GGT GC ACGAGGTCC AG AG AT ACATTGACCTT CTCCCC 
ACCAGCCTGCCCCATGCAGTGACCTGTGACATTAAATTCAGAAACTATCTCATTCCCAAGGGCACAACCATATTAATTTCCCTGAC 
TTCTGTGCTACATGACAACAAAGAATTTCCCAACCCAGAGATGTTTGACCCTCATCACTTTCTGGATGAAGGTGGCAATTTTAAGA 
AAAGTAAATACTTCATGCCTTTCTCAGCAGGAAAACGGATTTGTGTGGGAGAAGCCCTGGCCGGCATGGAGCTGTTTTTATTCCTG 
ACCTCCATTTTACAGAACTTTAACCTGAAATCTCTGGTTGACCCAGCGAACCTTGACACCACTCCAGTTGTCAATGGATTTGCCTC 
TGTGCCGCCCTTCTACCAGCTCTGCTTCATTCCTGTCCACCACCACCACTGA 

MAKKTSSKGRPPGPTPLPVIGNILQIGIKDISKSLTNLSKVYGPVFTLYFGLKPIVVLHGYEAVKEALIDLGEEFSGRGIFPLAER 
ANRGFGIVFSNGKKWKEIRRFSLMTLRNFGMGKRSIEDRVQEEARCLVEELRKTKASPCDPTFILGCAPCNVICSIIFHKRFDYKD 
QQFLNI^EKLNENIKILSSPWIQVYNNFPALLDYFPGTHNKLLKNVAFMKSYILEKV^ 

QPSEFTIESLENTAVDLFGAGTETTSTTLRYALLLLLKHPEVTAKVQEEIERVIGRNRSPCMQDRSHMPYTDAWHEVQRYIDLLP 
TSLPHAVTCDIKFRNYLIPKGTTILISLTSVLHDNKEFPNPEMFDPHHFLDEGGNFKKSKYFMPFSAGKRICVGEALAGMELFLFL 
T S I LQN FN LKS LV D PAN L DTT P WNG FAS V P P F YQLC F I P VHHHH 



1117 SEQ ID NO:29 & 30 

ATGGCTAAGAAAACGAGCTCT7WVGGGCGGCCGCCTGGCCCCACTCCTCTCCCAGTGATTGGAAATATCCTACAGATAGGTATTGC 
GGACATCAGCAAATCCTTAACCAATCTCTCAAAGGTCTATGGCCCTGTGTTCACTCTGTATTTTGGCCTGAAACCCATAGTGGTGC. 
TGCATGGATATGAAGCAGTGAAGGAAGCCCTGATTGATCTTGGAGAGGAGTTTTCTGGAAGAGGCATTTTCCCACTGGCTGAAAGA 
GCTAACAGAGGATTTGGAATTGTTTTCAGCAATGGAAAGAAATGGAAGGAGATCCGGCGTTTCTCCCTCATGACGCTGCGGAATTT 
TGGGATGGGGAAGAGGAGCATTGAGGACCGTGTTCAAGAGGAAGCCCGCTGCCTTGTGGAGGAGTTGAGAAAAACCAAGGCCTCAC 
CCTGTGATCCCACTTTCATCCTGGGCTGTGCTCCCTGCAATGTGATCTGCTCCATTATTTTCCATAAACGTTTTGATTATAAAGAT 
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CAGCAATTTCTTAACTTAATGGAAAAGTTGAATGAAAACATCAAGATTTTGAGCAGCCCCTGGATCCAGGTCTACAATAATTTCCC 
TGCTCTCCTTGATTATTTCCCGGGAACTCACAACAAATTACTTAAAAACGTTGCTTTTATGAAAAGTTATATTTTGGAAAAAGTAA 
AAGAACACCAAGAATCAATGGACATGAACAACCCTCAGGACTTTATTGATTGCTTCCTGATGAAAATGGAGAAGGAAAAGCACAAC 
CAACCATCTGAATTTACTATTGAAAGCTTGGAAAACACTGCAGTTGACTTGTTTGGAGCTGGGACAGAGACGACAAGCACAACCCT 
GAGATATGCTCTCCTTCTCCTGCTGAAGCACCCAGAGGTCACAGCTAAAGTCCAGGAAGAGATTGAACGTGTGATTGGCAGAAACC 
GGAGCCCCTGCATGCAAGACAGGAGCCACATGCCCTACACAGATGCTGTGGTGCACGAGGTCCAGAGATACATTGACCTTCTCCCC 
ACCAGCCTGCCCCATGCAGTGACCTGTGACATTAAATTCAGAAACTATCTCATTCCCAAGGGCACAACCATATTAATTTCCCTGAC 
TTC.TGTGCTACATGACAACAAAGAATTTCCCAACCCAGAGATGTTTGACCCTCATCACTTTCTGGATGAAGGTGGCAATTTTAAGA 
AAAGTAAATACTTCATGCCTTTCTCAGCAGGAAAACGGATTTGTGTGGGAGAAGCCCTGGCCGGCATGGAGCTGTTTTTATTCCTG 
ACCTCCATTTTACAGAACTTTAACCTGAAATCTCTGGTTGACCCAAAGAACCTTGACACCACTCCAGTTGTCAATGGATTTGCCTC 
TGTGCCGCCCTTCTACCAGCTCTGCTTCATTCCTGTCCACCACCACCACTGA 

MAKKTSSKGRPPGPTPLPVIGNILQIGIADISKSLTNLSKVYGPVFTLYFGLKPIWLHGYEAVKEALIDLGEEFSGRGIFPLAER 

ANRGFGIVFSNGKKWKEIRRFSLMTLRNFGMGKRSIEDRVQEEARCLVEELRKTKASPCDPTFILGCAPCNVICSIIFHKRFDYKD 

QQFLNLMEKLNENIKILSSPWIQVYNNFPALLDYFPGTHNKLLKWAFMKSYILEKVKEHQESMDMNNPQDFIDCFLMKMEKEKH 

QPSEFTIESLENTAVDLFGAGTETTSTTLRYALLLLLKHPEVTAKVQEEIERVIGRNRSPCMQDRSHMPYTDAWHEVQRYIDLLP 

TSLPHAVTCDIKFRNYLIPKGTTILISLTSVLHDNKEFPNPEMFDPHHFLDEGGNFKKSKYFMPFSAGKRICVGEALAGMELFLFL 

TSILQNFNLKSLVDPKNLOTTPWNGFASVPPFYQLCFIPVHHHH 



1118 SEQ ID NO: 31 & 32 

ATGGCTAAGAAAACGAGCTCTAAAGGGCGGCCGCCTGGCCCCACTCCTCTCCCAGTGATTGGAAATATCCTACAGATAGGTATTAA 

GGACATCAGCAAATCCTTAACCAATCTCTCAAAGGTCTATGGCCCTGTGTTCACTCTGTATTTTGGCCTGAAACCCATAGTGGTGC 

TG C AT GG AT ATGAAGCAGTGAAGGAAGCCCTG ATT GATCTTGGAGAGGAGTTTTC TGGAAGAGG CAT TTTCCCACTGGCT G AAAG A 

GCTAACAGAGGATTTGGAATTGTTTTCAGCAATGGAAAGAAATGGAAGGAGATCCGGCGTTTCTCCCTCATGACGCTGCGGAATTT 

TGGGATGGGGAAGAGGAGCATTGAGGACCGTGTTCAAGAGGAAGCCCGCTGCCTTGTGGAGGAGTTGAGAAAAACCGCGGCCTCAC 

CCTGTGATCCCACTTTCATCCTGGGCTGTGCTCCCTGCAATGTGATCTGCTCCATTATTTTCCATAAACGTTTTGATTATAAAGAT 

CAGCAATTTCTTAACTTAATGGAAAAGTTGAATGAAAACATCAAGATTTTGAGCAGCCCCTGGATCCAGGTCTACAATAATTTCCC 

TGCTCTCCTTGATTATTTCCCGGGAAGTCACAACAAATTACTTAAAAACGTTGCTTTTATGAAAAGTTATATTTTGGAAAAAGTAA 

AAGAACACCAAGAATCAATGGACATGAACAACCCT.CAGGACTTTATTGATTGCTTCCTGATGAAAATGGAGAAGGAAAAGCACAAC 

CAACCATCTGAATTTACTATTGAAAGCTTGGAAAACACTGCAGTTGACTTGTTTGGAGCTGGGACAGAGACGACAAGCACAACCCT 

GAGATATGCTCTCCTTCTCCTGCTGAAGCACCCAGAGGTCACAGCTAAAGTCCAGGAAGAGATTGAACGTGTGATTGGCAGAAACC- 

GGAGCCCCTGCATGCAAGACAGGAGCCACAT.GCCCTACACAGATGCTGTGGTGCACGAGGTCCAGAGATACATTGACCTTCTCCCC 

ACCAGCCTGCCCCATGCAGTGACCTGTGACATTAAATTCAGAAACTATCTCATTCCCAAGGGCACAACCATATTAATTTCCCTGAC 

TTCTGTGCTACATGACAACAAAGAATT.TCCCAACCCAGAGATGTTTGACCCTCATCACTTTCTGGATGAAGGTGGCAATTTTAAGA 

AAAGTAAATACTTCATGCCTTTCTCAGCAGGAAAACGGATTTGTGTGGGAGAAGCCCTGGCCGGCATGGAGCTGTTTTTATTCCTG 

ACCTCCATTTTACAGAACTTTAACCTGAAATCTCTGGTTGACCCAAAGAACCTTGACACCACTCCAGTTGTCAATGGATTTGCCTC 

TGTGCCGCCCTTCTACCAGCTCTGCTTCATTCCTGTCCACCACCACCACTGA 

MAKKTSSKGRPPGPTPLPVIGNILQIGIKDISKSLTNLSKVYGPVFTLYFGLKPIWLHGYEAVKEALIDLGEEFSGRGIFPLAER 
AN RG FG I V FSNGKKWKE I RR FS LMT LRN FGMGKR S I E DR VQEEARC L VE E LRKT AAS PCD PT F I LGC APCN V I CS 1 1 FHKRFD Y K D 
QQFLNLMEKLNENIKILSSPWIQVYNNFPALLDYFPGTHNKLLKNVAFMKSYILEKVKEHQESMDMNNPQDFI DCFLMKMEKEKHN 
QPSEFTIESLENTAVDLFGAGTETTSTTLRYALLLLLKHPEVTAKVQEEIERVIGRNRSPCMQDRSHMPYTDAWHEVQRYIDLLP 
TSLPHAVTCDIKFRNYLIPKGTTILISLTSVLHDNKEFPNPEMFDPHHFLDEGGNFKKSKYFMPFSAGKRICVGEALAGMELFLFL 
T S I LQN FN LKS LV D PKNLDTT P WNG FAS V P P FYQLC FI P VHHHH 



1121 SEQ ID NO: 33 & 34 

ATGGCTAAGAAAACGAGCTCTAAAGGGCGGCCGCCTGGCCCCACTCCTCTCCCAGTGATTGGAAATATCCTACAGATAGGTATTAA 
GGACATCAGCAAATCCTTAACCAATCTCTCAAAGGTCTATGGCCCTGTGTTCACTCTGTATTTTGGCCTGAAACCCATAGTGGTGC 
TGCATGGATATGAAGCAGTGAAGGAAGCCCTGATTGATCTTGGAGAGGAGTTTTCTGGAAGAGGCATTTTCCCACTGGCTGAAAGA 
GCTAACAGAGGATTTGGAATTGTTTTCAGCAATGGAAAGAAATGGAAGGAGATCCGGCGTTTCTCCCTCATGACGCTGCGGAATTT 
TGGGATGGGGAAGAGGAGCATTGAGGACCGTGTTCAAGAGGAAGCCCGCTGCCTTGTGGAGGAGTTGAGAAAAACCAAGGCCTCAC 
CCTGTGATCCCACTTTCATCCTGGGCTGTGCTCCCTGCAATGTGATCTGCTCCATTATTTTCCATAAACGTTTTGATTATAAAGAT 
CAGCAATTTCTTAACTTAATGGAAAAGTTGAATGAAAACATCAAGATTTTGAGCAGCCCCTGGATCCAGGTCTACAATAATTTCCC 
TGCTCTCCTTGATTATTTCCCGGGAACTCACAACAAATTACTTAAAAACGTTGCTTTTATGAAAAGTTATATTTTGGAAAAAGTAA 
AAGAACACCAAGAATCAATGGACATGAACAACCCTCAGGACTTTATTGATTGCTTCCTGATGAAAATGGAGGCGGAAAAGCACAAC 
CAACCATCTGAATTTACTATTGAAAGCTTGGAAAACACTGCAGTTGACTTGTTTGGAGCTGGGACAGAGACGACAAGCACAACCCT 
GAGATATGCTCTCCTTCTCCTGCTGAAGCACCCAGAGGTCACAGCTAAAGTCCAGGAAGAGATTGAACGTGTGATTGGCAGAAACC 
GGAGCCCCTGCATGCAAGACAGGAGCCACATGCCCTACACAGATGCTGTGGTGCACGAGGTCCAGAGATACATTGACCTTCTCCCC 
ACCAGCCTGCCCCATGCAGTGACCTGTGACATTAAATTCAGAAACTATCTCATTCCCAAGGGCACAACCATATTAATTTCCCTGAC 
TTCTGTGCTACATGACAACAAAGAATTTCCCAACCCAGAGATGTTTGACCCTCATCACTTTCTGGATGAAGGTGGCAATTTTAAGA 
AAAGTAAATACTTCATGCCTTTCTCAGCAGGAAAACGGATTTGTGTGGGAGAAGCCCTGGCCGGCATGGAGCTGTTTTTATTCCTG 
ACCTCCATTTTACAGAACTTTAACCTGAAATCTCTGGTTGACCCT^AAGAACCTTGACACCACTCCAGTTGTCAATGGATTTGCCTC 
TGTGCCGCCCTTCTACCAGCTCTGCTTCATTCCTGTCCACCACCACCACTGA 
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MAKKTSSKGRPPGPTPLPVIGNILQIGIKDISKSLTNLSKVYGPVFTLYFGLKPIW 

ANRGFGIVFSNGKKWKEIRRFSLMTLRNFGMGKRSIEDRVQEEARCLVEELRKTKASPCDPTFILGCAPCNVICSIIFHKRFDYKD 
QQFLNLMEKLNENIKILSSPWIQVYNNFPALLDYFPGTHNKLLKNVAFMKSYILEKVKEHQESMDMNNPQDFIDCFLMKMEAEKHN 
QPSEFTIESLENTAVDLFGAGTETTSTTLRYALLLLLKHPEVTAKVQEEIERVIGRNRSPCMQDRSHMPYTDAWHEVQRYIDLLP 
TSLPHAVTCDIKFRNYLIPKGTTILISLTSVLHDNKEFPNPEMFDPHHFLDEGGNFKKSKYFMPFSAGKRICVGEALAGMELFLFL 
TSILQNFNLKSLVDPKNLDTTPWNGFASVPPFYQLCFIPVHHHH 



1122 SEQ ID NO: 35 & 36 

ATGGCTAAGAAAACGAGCTCTAAAGGGCGGCCGCCTGGCCCCACTCCTCTCCCAGTGATTGGAAATATCCTACAGATAGGTATTAA 
GGACATCAGCAAATCCTTAACCAATCTCTCAAAGGTCTATGGCCCTGTGTTCACTCTGTATTTTGGCCTGAAACCCATAGTGGTGC 
TGCATGGATATGCAGCAGTGAAGGAAGCCCTGATTGATCTTGGAGAGGAGTTTTCTGGAAGAGGCATTTTCCCACTGGCTGAAAGA 
GCTAACAGAGGATTTGGAATTGTTTTCAGCAATGGAAAGAAATGGAAGGAGATCCGGCGTTTCTCCCTCATGACGCTGCGGAATTT 
TGGGATGGGGAAGAGGAGCATTGAGGACCGTGTTCAAGAGGAAGCCCGCTGCCTTGTGGAGGAGTTGAGAAAAACCAAGGCCTCAC 
CCTGTGATCCCACTTTCATCCTGGGCTGTGCTCCCTGCAATGTGATCTGCTCCATTATTTTCCATAAACGTTTTGATTATAAAGAT 
CAGCAATTTCTTAACTTAATGGAAAAGTTGAATGAAAACATCAAGATTTTGAGCAGCCCCTGGATCCAGGTCTACAATAATTTCCC 
TGCTCTCCTTGATTATTTCCCGGGAACTCACAACAAATTACTTAAAAACGTTGCTTTTATGAAAAGTTATATTTTGGAAAAAGTAA 
AAG AACACC AAGAATCAAT GGAC ATG AACAACC CTCAGGACTTT ATTG ATTGCT TCCTGATGAAAATGGAGAAGGAAAAGCACAAC 
CAACCATCTGAATTTACTATTGAAAGCTTGGAAAACACTGCAGTTGACTTGTTTGGAGCTGGGACAGAGACGACAAGCACAACCCT 
GAGATATGCTCTCCTTCTCCTGCTGAAGCACCCAGAGGTCACAGCTAAAGTCCAGGAAGAGATTGAACGTGTGATTGGCAGAAACC 
GGAGCCCCTGCATGCAAGACAGGAGCCACATGCCCTACACAGATGCTGTGGTGCACGAGGTCCAGAGATACATTGACCTTCTCCCC 
ACCAGCCTGCCCCATGCAGTGACCTGTGACATTAAATTCAGAAACTATCTCATTCCCAAGGGCACAACCATATTAATTTCCCTGAC 
TTCTGTGCTACATGACAACAAAGAATTTCCCAACCCAGAGATGTTTGACCCTCATCACTTTCTGGATGAAGGTGGCAATTTTAAGA 
AAAGTAAATACTTCATGCCTTTCTCAGCAGGAAAACGGATTTGTGTGGGAGAAGCCCTGGCCGGCATGGAGCTGTTTTTATTCCTG 
ACCTCCATTTTACAGAACTTTAACCTGAAATCTCTGGTTGACCCAAAGAACCTTGACACCACTCCAGTTGTCAATGGATTTGCCTC 
TGTGCCGCCCTTCTACCAGCTCTGCTTCATTCCTGTCCACCACCACCACTGA . 

MMKTSSKGRPPGPTPLPVIGNILQIGIKDISKSLTNLSKVYGPVFTLYFGLKPIVVLHGYAAVECEALIDLGEEFSGRGIFPLAER 

ANRGFGIVFSNGKKWKEIRRFSI^TLRNEXSMGKRSIEDRVQEEARCLVEELRKTKASPCDPTFILGCAPCNVI 

QQFLNLMEKLNENIKILSSPWIQVYNNFPALLDYFPGTHNKLLKNVAF^^ 

QPSEFTIESLENTAVDLFGAGTETTSTTLRYALLLLLKHPEVTAKVQEEIERVIGRNRSPCMQDRSHMPYTDAWHEVQRYIDLLP 
TSLPHAVTCDIKFRNYLIPKGTTILISLTSVLHDNKEFPNPEMFDPHHFLDEGGNFKKSKY.FMPFSAGKRICVGEALAGMELFLFL 
TSILQNFNLKSLVDPKNLDTTPWNGFASVPPFYQLCFIPVHHHH 

1123 SEQ ID NO: 37 & 38 

. *i '-."•*'. . ' * ■*■' . * ' 

ATGGCTAAGAAAACGAGCTCTAAAGGGCGGCCGCCTGGCCCCACTCCTCTCCCAGTGATTGGAAATATCCTACAGATAGGTATTAA 
GGACATCAGCAAATCCTTAACCAATCTCTCAAAGGTCTATGGCCCTGTGTTCACTCTGTATTTTGGCCTGAAACCCATAGTGGTGC 
TGCATGGATATGAAGCAGTGAAGGAAGCCCTGATTGATCTTGGAGAGGAGTTTTCTGGAAGAGGCATTTTCCCACTGGCTGAAAGA 
GCTAACAGAGGATTTGGAATTGTTTTCAGCAATGGAGCGGCATGGGCGGAGATCCGGCGTTTCTCCCTCATGACGCTGCGGAATTT 
TGGGATGGGGAAGAGGAGCATTGAGGACCGTGTTCAAGAGGAAGCCCGCTGCCTTGTGGAGGAGTTGAGAAAAACCAAGGCCTCAC 
CCTGTGATCCCACTTTCATCCTGGGCTGTGCTCCCTGCAATGTGATCTGCTCCATTATTTTCCATAAACGTTTTGATTATAAAGAT 
CAGCAATTTCTTAACTTAATGGAAAAGTTGAATGAAAACATCAAGATTTTGAGCAGCCCCTGGATCCAGGTCTACAATAATTTCCC 
TGCTCTCCTTGATTATTTCCCGGGAACTCACAACAAATTACTTAAAAACGTTGCTTTTATGAAAAGTTATATTTTGGAAAAAGTAA 
AAGAACACCAAGAATCAATGGACATGAACAACCCTCAGGACTTTATTGATTGCTTCCTGATGAAAATGGAGAAGGAAAAGCACAAC 
CAACCATCTGAATTTACTATTGAAAGCTTGGAAAACACTGCAGTTGACTTGTTTGGAGCTGGGACAGAGACGACAAGCACAACCCT 
GAGATATGCTCTCCTTCTCCTGCTGAAGCACCCAGAGGTCACAGCTAAAGTCCAGGAAGAGATTGAACGTGTGATTGGCAGAAACC 
GGAGCCCCTGCATGCAAGACAGGAGCCACATGCCCTACACAGATGCTGTGGTGCACGAGGTCCAGAGATACATTGACCTTCTCCCC 
ACCAGCCTGCCCCATGCAGTGACCTGTGACATTAAATTCAGAAACTATCTCATTCCCAAGGGCACAACCATATTAATTTCCCTGAC 
TTCTGTGCTACATGACAACAAAGAATTTCCCAACCCAGAGATGTTTGACCCTCATCACTTTCTGGATGAAGGTGGCAATTTTAAGA 
AAAGTAAATACTTCATGCCTTTCTCAGCAGGAAAACGGATTTGTGTGGGAGAAGCCCTGGCCGGCATGGAGCTGTTTTTATTCCTG 
ACCTCCATTTTACAGAACTTTAACCTGAAATCTCTGGTTGACCCAAAGAACCTTGACACCACTCCAGTTGTCAATGGATTTGCCTC 
TGTGCCGCCCTTCTACCAGCTCTGCTTCATTCCTGTCCACCACCACCACTGA 

MAKKTSSKGRPPGPTPLPVIGNILQIGIKDISKSLTNLSKVYGPVFTLYFGLKPIWLHGYEAVKEALIDLGEEFSGRGIFPLAER 
ANRGFGIVFSNGAAWAEIRRFSLMTLRNFGMGKRSIEDRVQEEARCLVEELRKTKASPCDPTFILGCAPCNVICSIIFHKRFDYKD 
QQFLNUIEKLNENIKILSSPWIQVYNNFPALLDYFPGTHNKLLKNVA^ 

QPSEFTIESLENTAVDLFGAGTETTSTTLRYALLLLLKHPEVTAKVQEEIERVIGRNRSPCMQDRSHMPYTDAVVHEVQRYIDLLP 
TSLPHAVTCDIKFRNYLIPKGTTILISLTSVLHDNKEFPNPEMFDPHHFLDEGGNFKKSKYFMPFSAGKRICVGEALAGMELFLFL 
TSILQNFNLKSLVDPKNLDTTPWNGFASVPPFYQLCFIPVHHHH 



1165 SEQ ID NO:39 & 40 

ATGGCTAAGAAAACGAGCTCTAAAGGGCGGCCGCCTGGCCCCACTCCTCTCCCAGTGATTGGAAATATCCTACAGATAGGTATTAA 
GGACATCAGCAAATCCTTAACCAATCTCTCAAAGGTCTATGGCCCTGTGTTCACTCTGTATTTTGGCCTGAAACCCATAGTGGTGC 
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TGCATGGATATGAAGCAGTGAAGGAAGCCCTGATTGATCTTGGAGAGGAGTTTTCTGGAAGAGGCATTTTCCCACTGGCTGAAAGA 
GCTAACAGAGGATTTGGAATTGTTTTCAGCAATGGAAAGAAATGGAAGGAGATCCGGCGTTTCTCCCTCATGACGCTGCGGAATTT 
TGGGATGGGGAAGAGGAGCATTGAGGACCGTGTTCAAGAGGAAGCCCGCTGCCTTGTGGAGGAGTTGAGAAAAACCAAGGCCTCAC 
CCTGTGATCCCACTTTCATCCTGGGCTGTGCTCCCTGCAATGTGATCTGCTCCATTATTTTCCATAAACGTTTTGATTATAAAGAT 
CAGCAATTTCTTAACTTAATGGAAAAGTTGAATGAAAACATCAAGATTTTGAGCAGCCCCTGGATCCAGGTCTACAATAATTTCCC 
TGCTCTCCTTGATTATTTCCCGGGAACTCACAACAAATTACTTAAAAACGTTGCTTTTATGAAAAGTTATATTTTGGAAAAAGTAA 
AAGAACACCAAGAATCAATGGACATGAACAACCCTCAGGACTTTATTGATTGCTTCCTGATGAAAATGGAGAAGGAAAAGCACTCT 
GAATTTACTATTGAAAGCTTGGAAAACACTGCAGTTGACTTGTTTGGAGCTGGGACAGAGACGACAAGCACAACCCTGAGATATGC 
TCTCCTTCTCCTGCTGAAGCACCCAGAGGTCACAGCTAAAGTCCAGGAAGAGATTGAACGTGTGATTGGCAGAAACCGGAGCCCCT 
GCATGCAAGACAGGAGCCACATGCCCTACACAGATGCTGTGGTGCACGAGGTCCAGAGATACATTGACCTTCTCCCCACCAGCCTG 
CCCCATGCAGTGACCTGTGACATTAAATTCAGAAACTATCTCATTCCCAAGGGCACAACCATATTAATTTCCCTGACTTCTGTGCT 
ACATGACAACAAAGAATTTCCCAACCCAGAGATGTTTGACCCTCATCACTTTCTGGATGAAGGTGGCAATTTTAAGAAAAGTAAAT 
ACTTCATGCCTTTCTCAGCAGGAAAACGGATTTGTGTGGGAGAAGCCCTGGCCGGCATGGAGCTGTTTTTATTCCTGACCTCCATT 
TTACAGAACTTTAACCTGAAATCTCTGGTTGACCCAAAGAACCTTGACACCACTCCAGTTGTCAATGGATTTGCCTCTGTGCCGCC 
CTTCTACCAGCTCTGCTTCATTCCTGTCCACCACCACCACTGA 

MAKKTSSKGRPPGPTPLPVIGNILQIGIKDISKSLTNLSKVYGPVFTLYFGLKPIVVLHGYEAVKEALIDLGEEFSGRGIFPLAER 
ANRGFGIVFSNGKKWKEIRRFSLMTLRNFGMGKRSIEDRVQEEARCLVEELRKTKASPCDPTFILGCAPCNVICSIIFHKRFDYKD 
QQFLNLMEKLNENIKILSSPWIQVYNNFPALLDYFPGTHNKLLKNVAFMKSYILEKVKEHQESMDt4NNPQDFIDCFLMKMEKEKHS 
EFTIESLENTAVDLFGAGTETTSTTLRYALLLLLKHPEVTAKVQEEIERVIGRNRSPCMQDRSHMPYTDAWHEVQRYIDLLPTSL 
PHAVTCDIKFRNYLIPKGTTILISLTSVLHDNKEFPNPEMFDPHHFLDEGGNFKKSKYFMPFSAGKRICVGEALAGMELFLFLTSI 
LQNFNLKSLVDPKNLDTTPWNGFASVPPFYQLCFIPVHHHH 



1220 SEQ ID NO: 41 & 42 

ATGGCTAAGAAAACG AGCT CTAAAGGGCGGCCGCCTGGCCCCACTCCTCTCCCAGTGATTGGAAAT ATCCTACAGATAGGTATT AA 
GGACATCAGCAAATCCTTAACCAATCTCTCAAAGGTCTATGGCCCTGTGTTCACTCTGTATTTTGGCCTGAAACCCATAGTGGTGC 
TGCATGGATATGAAGCAGTGAAGGAAGCCCTGATTGATCTTGGAGAGGAGTTTTCTGGAAGAGGCATTTTCCCACTGGCTGAAAGA 
< GCTAACAGAGGATTTGGAATTGTTTTCAGCAATGGAAAGAAATGGAAGGAGATCCGGCGTTTCTCCCTCATGACGCTGCGGAATTT, 
TGGGATGGGGAAGAGGAGCATTGAGGACCGTGTTCAAGAGGAAGCCCGCTGCCTTGTGGAGGAGTTGAGAAAAACCAAGGCCTCAC 
CCTGTGATCCCACTTTCATCCTGGGCTGTGCTCCCTGCAATGTGATCTGCTCCATTATTTTCCATAAACGTTTTGATTATAAAGAT 
CAGCAATTTCTTAACTTAATGGAAAAGTTGAATGAAAACATCAAGATTTTGAGCAGCCCCTGGATCCAGATCTGCAATAATTTTCC 
TTCTATCATTGATTACTTCCCGGGAACTCACAACAAATTACTTAAAAACGTTGCTTTTATGAAAAGTTATATTTTGGAAAAAGTAA 
AAGAACACCAAGAATCAATGGACATGAACAACCCTCAGGACTTTATTGATTGCTTCCTGATGAAAATGGAGAAGGAAAAGCACAAC 
CAACCATCTGAATTTACTATTGAAAGCTTGGAAAACACTGCAGTTGACTTGTTTGGAGCTGGGACAGAGACGACAAGCACAACCCT 
GAGATATGCTCTCCTTCTCCTGCTGAAGCACCCAGAGGTCACAGCTAAAGTCCAGGAAGAGATTGAACGTGTGATTGGCAGAAACC 
GGAGCCCCTGCATGCAAGACAGGAGCCACATGCCCTACACAGATGCTGTGGTGCACGAGGTCCAGAGATACATTGACCTTCTCCCC 
ACCAGCCTGCCCCATGCAGTGACCTGTGACATTAAATTCAGAAACTATCTCATTCCCAAGGGCACAACCATATTAATTTCCCTGAC 
TTCTGTGCTACATGACAACAAAGAATTTCCCAACCCAGAGATGTTTGACCCTCATCACTTTCTGGATGAAGGTGGCAATTTTAAGA 
AAAGTAAATACTTCATGCCTTTCTCAGCAGGAAAACGGATTTGTGTGGGAGAAGCCCTGGCCGGCATGGAGCTGTTTTTATTCCTG 
ACCTCCATTTTACAGAACTTTAACCTGAAATCTCTGGTTGACCCAAAGAACCTTGACACCACTCCAGTTGTCAATGGATTTGCCTC 
TGTGCCGCCCTTCTACCAGCTCTGCTTCATTCCTGTCCACCACCACCACTGA 

MAKKTSSKGRPPGPTPLPVIGNILQIGIKDISKSLTNLSKVYGPVFTLYFGLKPIVVLHGYEAVKEALIDLGEEFSGRGIFPLAER 
ANRGFGIVFSNGKKWKEIRRFSLMTLRNFGMGKRSIEDRVQEEARCLVEELRKTKASPCDPTFILGCAPCNVICSIIFHKRFDYKD 
QQFLNLMEKLNENIKILSSPWIQICNNFPSIIDYFPGTHNKLLKNVAFMKSYILEKVKEHQESMDMNNPQDFIDCFLMBCMEKEKHN 
QPSEFTIESLENTAVDLFGAGTETTSTTLRYALLLLLKHPEVTAKVQEEIERVIGRNRSPCMQDRSHMPYTDAWHEVQRYIDLLP 
TSLPHAVTCDIKFRNYLIPKGTTILISLTSVLHDNKEFPNPEMFDPHHFLDEGGNFKKSKYFMPFSAGKRICVGEALAGMELFLFL 
TSILQNFNLKSLVDPKNLDTTP\A^NGFASVPPFYQLCFIPVHHHH 



1319 SEQ ID NO:43 & 44 

ATGGCTAAGAAAACGAGCTCTAAAGGGCGGCCGCCTGGCCCCACTCCTCTCCCAGTGATTGGAAATATCCTACAGATAGGTATTAA 
GGACATCAGCAAATCCTTAACCAATCTCTCAAAGGTCTATGGCCCTGTGTTCACTCTGTATTTTGGCTCGAAACCCATAGTGGTGC 
TGCATGGATATGAAGCAGTGAAGGAAGCCCTGATTGATCTTGGAGAGGAGTTTTCTGGAAGAGGCATTTTCCCACTGGCTGAAAGA 
GCTAACAGAGGATTTGGAATTGTTTTCAGCAATGGAAAGAAATGGAAGGAGATCCGGCGTTTCTCCCTCATGACGCTGCGGAATTT 
TGGGATGGGGAAGAGGAGCATTGAGGACCGTGTTCAAGAGGAAGCCCGCTGCCTTGTGGAGGAGTTGAGAAAAACCAAGGCCTCAC 
CCTGTGATCCCACTTTCATCCTGGGCTGTGCTCCCTGCAATGTGATCTGCTCCATTATTTTCCATAAACGTTTTGATTATAAAGAT 
CAGCAATTTCTTAACTTAATGGAAAAGTTGAATGAAAACATCAAGATTTTGAGCAGCCCCTGGATCCAGGTCTACAATAATTTCCC 
TGCTCTCCTTGATTATTTCCCGGGAACTCACAACAAATTACTTAAAAACGTTGCTTTTATGAAAAGTTATATTTTGGAAAAAGTAA 
AAGAACACCAAGAATCAATGGACATGAACAACCCTCAGGACTTTATTGATTGCTTCCTGATGAAAATGGAGAAGGAAAAGCACAAC 
CAACCATCTGAATTTACTATTGAAAGCTTGGAAAACACTGCAGTTGACTTGTTTGGAGCTGGGACAGAGACGACAAGCACAACCCT 
GAGATATGCTCTCCTTCTCCTGCTGAAGCACCCAGAGGTCACAGCTAAAGTCCAGGAAGAGATTGAACGTGTGATTGGCAGAAACC 
GGAGCCCCTGCATGCAAGACAGGAGCCACATGCCCTACACAGATGCTGTGGTGCACGAGGTCCAGAGATACATTGACCTTCTCCCC 
ACCAGCCTGCCCCATGCAGTGACCTGTGACATTAAATTCAGAAACTATCTCATTCCCAAGGGCACAACCATATTAATTTCCCTGAC 
TTCTGTGCTACATGACAACAAAGAATTTCCCAACCCAGAGATGTTTGACCCTCATCACTTTCTGGATGAAGGTGGCAATTTTAAGA 
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AAAGTAAATACTTCATGCCTTTCTCAGCAGGAAAACGGATTTGTGTGGGAGAAGCCCTGGCCGGCATGGAGCTGTTTTTATTCCTG 
ACCTCCATTTTACAGAACTTTAACCTGAAATCTCTGGTTGACCCAAAGAACCTTGACACCACTCCAGTTGTCAATGGATTTGCCTC 
TGTG CCG CCCTTCT ACCAG CT CTG CTTC ATT CCTGTCC ACC AC CACC ACTG A 

MAKKTSSKGRPPGPTPLPVIGNILQIGIKDISKSLTNLSKVYGPVFTLYFGSKPIVVLHGYEAVKEALIDLGEEFSGRGIFPLAER 
ANRGFGIVFSNGKECWKEIRRFSLMTLRNFGMGKRSIEDRVQEEARCLVEELRKTKASPCDPTFILGCAPCNVICSIIFHKRFDYKD 
QQFLNLMEKLNENIKILSSPWIQVYNNFPALLDYFPGTHNKLLKNVAFMKSYILEKVKEHQESMDMNNPQDFIDCFLMKMEKEKHN 
QPSEFTIESLENTAVDLFGAGTETTSTTLRYALLLLLKHPEVTAKVQEEIERVIGRNRSPCMQDRSHMPYTDAWHEVQRYIDLLP 
T S L PHAVTC D I K FRN Y L IPKGTTILISLTSVLHDNKEFPNPEMFDPHHFL DEGGN FKKSK Y FM P FS AGKR I CVGEALAGME L FL FL 
TSILQNFNLKSLVDPKNLDTTPWNGFASVPPFYQLCFIPVHHHH 



1339 SEQ ID NO:45 & 46 

ATGGCTAAGAAAACGAGCTCTAAAGGGCGGCCGCCTGGCCCCACTCCTCTCCCAGTGATTGGAAATATCCTACAGATAGGTATTAA 
GG AC ATCAGCAAATCCTTAACCAATCTCTCAAAGGTCTATGGCCCTGTGTTC ACT CTGTATTTTGGCCTGAAACCC AT AGTGGTGC 
TGCATGG AT ATG AAGG AGTG AAGGAAG CCCTG ATTG AT CTTGGAG AGGAGTTTT CT GGAAGAGGC ATTTT CCCACT GGCTG AAAG A 
GCTAACAGAGGATTTGGAATTGTTTTCAGCAATGGAAAGAAATGGAAGGAGATCCGGCGTTTCTCCCTCATGACGCTGCGGAATTT 
TGGGATGGGGAAGAGGAGCATTGAGGACCGTGTTCAAGAGGAAGCCCGCTGCCTTGTGGAGGAGTTGAGAAAAACCAAGGCCTCAC 
CCTGTGATCCCACTTTCATCCTGGGCTGTGCTCCCTGCAATGTGATCTGCTCCATTATTTTCCATAAACGTTTTGATTATAAAGAT 
CAGCAATTTCTTAACTTAATGGAAAAGTTGAATGAAAACATCAAGATTTTGAGCAGCCCCTGGATCCAGGTCTACAATAATTTCCC 
TGCTCTCCTTGATTATTTCCCGGGAACTCACAACAAATTACTTAAAAACGTTGCTTTTATGAAAAGTTTTATTTTGGAAAAAGTAA 
AAGAACACCAAGAATCAATGGACATGAACAACCCTCAGGACTTTATTGATTGCTTCCTGATGAAAATGGAGAAGGAAAAGCACAAC 
CAACCATCTGAATTTACTATTGAAAGCTTGGAAAACACTGCAGTTGACTTGTTTGGAGCTGGGACAGAGACGACAAGCACAACCCT 
GAGATATGCTCTCCTTCTCCTGCTGAAGCACCCAGAGGTCACAGCTAAAGTCCAGGAAGAGATTGAACGTGTGATTGGCAGAAACC 
GGAGCCCCTGCATGCAAGACAGGAGCCACATGCCCTACACAGATGCTGTGGTGCACGAGGTCCAGAGATACATTGACCTXCTCCCC 
ACCAGCCTGCCCCATGCAGTGACCTGTGACATTAAATTCAGAAACTATCTCATTCCCAAGGGCACAACCATATTAATTTCCCTGAC 
TTCTGTGCTACATGACAACAAAGAATTTCCCAACCCAGAGATGTTTGACCCTCATCACTTTCTGGATGAAGGTGGCAATTTTAAGA 
AAAGTAAATACTTCATGCCTTTCTCAGCAGGAAAACGGATTTGTGTGGGAGAAGCCCTGGCCGGCATGGAGCTGTTTTTATTCCTG 
ACCTCCATTTTACAGAACTTTAACCTGAAATCTCTGGTTGACCCAAAGAACCTTGACACCACTCCAGTTGTCAATGGATTTGCCTC 
TGTGCCG CCCTTCT AC CAGCTGTGCTTCATTCCTGTCCATCATC AT CATTGA 

! 7\. " *\f 

MAKKTSSKGRPPGPTPLPVIGNILQIGIKDISKSLTNLSKVYGPVFTLYFGLKPIWLHGYEAVKEALIDLGEEFSGRGIFPLAER 
ANRGFGIVFSNGKKWKEIRRFSLMTLRNFGMGKRSIEDRVQEEARCLVEELRKTKASPCDPTFILGCAPCNVICSIIFHKRFDYKD 
QQFLNI^EKLNENIKILSSPWIQV.YNNFPALLDYFPGTHNKLLK^^ 

QPSEFTIESLENTAVDLFGAGTE.TTSTTLRYALLLLLKHPEVTAKVQEEIERVIGRNRSPCMQDRSHMPYTDAWHEVQRYIDLLP 
TSLPHAVTCDIKFRNYLIPKGTTILISLTSVLHDNKEFPNPEMFDPHHFLDEGGNFKKSKYFMPFSAGKRICVGEALAGMELFLFL 
TSILQNFNLKSLVDPKNLDTTPWNGFASVPPFYQLCFIPVHHHH 



1340 SEQ ID NO:47 & 48 

ATGGCTAAGAAAACGAGCTCTAAAGGGCGGCCGCCTGGCCCCACTCCTCTCCCAGTGATTGGAAATATCCTACAGATAGGTATTAA 
GGACATCAGCAAATCCTTAACCAATCTCTCAAAGGTCTATGGCCCTGTGTTCACTCTGTATTTTGGCCTGAAACCCATAGTGGTGC 
TGCATGGATATGCAGCAGTGAAGGAAGCCCTGATTGATCTTGGAGAGGAGTTTTCTGGAAGAGGCATTTTCCCACTGGCTGAAAGA 
GCTAACAGAGGATTTGGAATTGTTTTCAGCAATGGAAAGAAATGGAAGGAGATCCGGCGTTTCTCCCTCATGACGCTGCGGAATTT 
TGGGATGGGGAAGAGGAGCATTGAGGACCGTGTTCAAGAGGAAGCCCGCTGCCTTGTGGAGGAGTTGAGAAAAACCAAGGCCTCAC 
CCTGTGATCCCACTTTCATCCTGGGCTGTGCTCCCTGCAATGTGATCTGCTCCATTATTTTCCATAAACGTTTTGATTATAAAGAT 
CAGCAATTTCTTAACTTAATGGAAAAGTTGAATGAAAACATCAAGATTTTGAGCAGCCCCTGGATCCAGGTCTACAATAATTTCCC 
TGCTCTCCTTGATTATTTCCCGGGAACTCACAACAAATTACTTAAAAACGTTGCTTTTATGAAAAGTTTTATTTTGGAAAAAGTAA 
AAGAACACCAAGAATCAATGGACATGAACAACCCTCAGGACTTTATTGATTGCTTCCTGATGAAAATGGAGAAGGAAAAGCACAAC 
CAACCATCTGAATTTACTATTGAAAGCTTGGAAAACACTGCAGTTGACTTGTTTGGAGCTGGGACAGAGACGACAAGCACAACCCT 
GAGATATGCTCTCCTTCTCCTGCTGAAGCACCCAGAGGTCACAGCTAAAGTCCAGGAAGAGATTGAACGTGTGATTGGCAGAAACC 
GGAGCCCCTGCATGCAAGACAGGAGCCACATGCCCTACACAGATGCTGTGGTGCACGAGGTCCAGAGATACATTGACCTTCTCCCC 
ACCAGCCTGCCCCATGCAGTGACCTGTGACATTAAATTCAGAAACTATCTCATTCCCAAGGGCACAACCATATTAATTTCCCTGAC 
TTCTGTGCTACATGACAACAAAGAATTTCCCAACCCAGAGATGTTTGACCCTCATCACTTTCTGGATGAAGGTGGCAATTTTAAGA 
AAAGTAAATACTTCATGCCTTTCTCAGCAGGAAAACGGATTTGTGTGGGAGAAGCCCTGGCCGGCATGGAGCTGTTTTTATTCCTG 
ACCTCCATTTTACAGAACTTTAACCTGAAATCTCTGGTTGACCCAAAGAACCTTGACACCACTCCAGTTGTCAATGGATTTGCCTC 
TGTGCCGCCCTTCTACCAGCTGTGCTTCATTCCTGTCCATCATCATCATTGA 

MAKKTSSKGRPPGPTPLPVIGNILQIGIKDISKSLTNLSKVYGPVFTLYFGLKPIWLHGYAAVKEALIDLGEEFSGRGIFPLAER 
ANRGFGIVFSNGKKWKEIRRFSLMTLRNFGMGKRSIEDRVQEEARCLVEELRKTKAS PCDPTFILGCAPCNVICSIIFHKRFDYKD 
QQFLNLMEKLNENIKILSSPWIQVYNNFPALLDYFPGTHNKLLKNVAFMKSFILEKVKEHQESMDMNNPQDFIDCFLMKMEKEKHN 
QPSEFTIESLENTAVDLFGAGTETTSTTLRYALLLLLKHPEVTAKVQEEIERVIGRNRSPCMQDRSHMPYTDAWHEVQRYIDLLP 
TSLPHAVTCDIKFRNYLIPKGTTILISLTSVLHDNKEFPNPEMFDPHHFLDEGGNFKKSKYFMPFSAGKRICVGEALAGMELFLFL 
TSILQNFNLKSLVDPKNLDTTPVVNGFASVPPFYQLCFIPVHHHH 
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1361 SEQ ID NO:49 & 50 

ATGGCTAAGAAAACGAGCTCTAAAGGGCGGCCGCCTGGCCCCACTCCTCTCCCAGTGATTGGAAATATCCTACAGATAGGTATTAA 
GGACATCAGCAAATCCTTAACCAATCTCTCAAAGGTCTATGGCCCTGTGTTCACTCTGTATTTTGGCCTGAAACCCATAGTGGTGC 
TGCATGGATATGAAGCAGTGAAGGAAGCCCTGATTGATCTTGGAGAGGAGTTTTCTGGAAGAGGCATTTTCCCACTGGCTGAAAGA 
GCTAACAGAGGATTTGGAATTGTTTTCAGCAATGGAAAGAAATGGAAGGAGATCCGGCGTTTCTCCCTCATGACGCTGCGGAATTT 
TGGGATGGGGAAGAGGAGCATTGAGGACCGTGTTCAAGAGGAAGCCCGCTGCCTTGTGGAGGAGTTGAGAAAAACCAAGGCCTCAC 
CCTGTGATCCCACTTTCATCCTGGGCTGTGCTCCCTGCAATGTGATCTGCTCCATTATTTTCCATAAACGTTTTGATTATAAAGAT 
CAGCAATTTCTTAACTTAATGGAAAAGTTGAATGAAAACATCAAGATTTTGAGCAGCCCCTGGATCCAGATTTACAATAATTTCCC 
TGCTCTCCTTGATTATTTCCCGGGAACTCACAACAAATTACTTAAAAACGTTGCTTTTATGAAAAGTTATATTTTGGAAAAAGTAA 
AAGAACACCAAGAATCAATGGACATG AAC AACCCTCAGG ACTT T ATT GAT TGCTTC CTGATG AAAATGG AG AAGG AAAAG C AC AAC 
CAACCATCTGAATTTACTATTGAAAGCTTGGAAAACACTGCAGTTGACTTGTTTGGAGCTGGGACAGAGACGACAAGCACAACCCT 
GAGATATGCTCTCCTTCTCCTGCTGAAGCACCCAGAGGTCACAGCTAAAGTCCAGGAAGAGATTGAACGTGTGATTGGCAGAAACC 
GGAGCCCCTGCATGCAAGACAGGAGCCACATGCCCTACACAGATGCTGTGGTGCACGAGGTCCAGAGATACATTGACCTTCTCCCC 
ACCAGCCTGCCCCATGCAGTGACCTGTGACATTAAATTCAGAAACTATCTCATTCCCAAGGGCACAACCATATTAATTTCCCTGAC 
TTCTGTGCTACATGACAACAAAGAATTTCCCAACCCAGAGATGTTTGACCCTCATCACTTTCTGGATGAAGGTGGCAATTTTAAGA 
AAAGTAAATACTTCATGCCTTTCTCAGCAGGAAAACGGATTTGTGTGGGAGAAGCCCTGGCCGGCATGGAGCTGTTTTTATTCCTG 
ACCTCCATTTTACAGAACTTTAACCTGAAATCTCTGGTTGACCCAAAGAACCTTGACACCACTCCAGTTGTCAATGGATTTGCCTC 
T GTGCCGCCCTTCT AC CAG CTGTGCTT C ATTCCTGTCC AC CACC ACCACTGA 

MAKKTSSKGRPPGPTPLPVIGNILQIGIKDISKSLTNLSKVYGPVFTLYHX3LKPIVVLHGYEAVKEALIDLGEEFSGRGIFPLAER 
ANRGFGIVFSNGKKWKEIRRFSLMTLRNFGMGKRSIEDRVQEEARCLVEELRKTKASPCDPTFILGCAPCNVICSIIFHKRFDYKD 
QQFLNLMEKLNENIKILSSPWIQIYNNFPALLDYFPGTHNKLLKNVAFMKSYILEKVKEHQESMDMNNPQDFIDCFLMKMEKEKHN 
QPSEFTIESLENTAVDLFGAGTETTSTTLRYALLLLLKHPEVTAKVQEEIERVIGRNRSPCMQDRSHMPYTDAWHEVQRYIDLLP 
TSLPHAVTCDIKFRNYLIPKGTTILISLTSVLHDNKEFPNPEMFDPHHFLDEGGNFKKSKYFMPFSAGKRICVGEALAGMELFLFL 
TSILQNFNLKSLVDPKNLDTTPVVNGFASVPPFYQLCFIPVHHHH 

1362 SEQ ID NO: 51 & 52 *;\": •,'..-.''.! 

ATGGCTAAGAAAACGAGCTCTAAAGGGCGGCCGCCTGGCCCCACTCCTCTCCCAGTGATTGGAAATATCCTACAGATAGGTATTAA 
GGACATCAGCAAATCCTTAACCAATCTCTCAAAGGTCTATGGCCCTGTGTTCACTCTGTATTTTGGCCTGAAACCCATAGTGGTGC 
TGCATGGATATGAAGCAGTGAAGGAAGCCCTGATTGATCTTGGAGAGGAGTTTTCTGGAAGAGGCATTTTCCCACTGGCTGAAAGA 
GCTAACAGAGGATTTGGAATTGTTTTCAGCAATGGAAAGAAATGGAAGGAGATCCGGCGTTTCTCCCTCATGACGCTGCGGAATTT 
TGGGATGGGGAAGAGGAGCATTGAGGACCGTGTTCAAGAGGAAGCCCGCTGCCTTGTGGAGGAGTTGAGAAAAACCAAGGCCTCAC 
CCTGTGATCCCACTTTCATCCTGGGCTGTGCTCCCTGCAATGTGATCTGCTCCATTATTTTCCATAAACGTTTTGATTATAAAGAT 
CAGCAATTTCTTT^CTTAATGGAAAAGTTGAATGAAAACATCAAGATTTTGAGCAGCCCCTGGATCCAGGTCTiGCAATAATTTCCC 
TGCTCTCCTTGATTATTTCCCGGGAACTCACAACAAATTACTTAAAAACGTTGCTTTTATGAAAAGTTATATTTTGGAAAAAGTAA 
AAGAACACCAAGAATCAATGGACATGAACAACCCTCAGGACTTTATTGATTGCTTCCTGATGAAAATGGAGAAGGAAAAGCACAAC 
CAACCATCTGAATTTACTATTGAAAGCTTGGAAAACACTGCAGTTGACTTGTTTGGAGCTGGGACAGAGACGACAAGCACAACCCT 
GAGATATGCTCTCCTTCTCCTGCTGAAGCACCCAGAGGTCACAGCTAAAGTCCAGGAAGAGATTGAACGTGTGATTGGCAGAAACC 
GGAGCCCC TGC ATGCAAGAC AGGAG CCACATGCC CT AC AC AGATG CTGTGGTGC ACG AGGTCCAG AGAT AC ATTGACCTTCT CC C C 
ACCAGCCTGCCCCATGCAGTGACCTGTGACATTAAATTCAGAAACTATCTCATTCCCAAGGGCACAACCATATTAATTTCCCTGAC 
TTCTGTGCTACATGACAACAAAGAATTTCCCAACCCAGAGATGTTTGACCCTCATCACTTTCTGGATGAAGGTGGCAATTTTAAGA 
AAAGTAAATACTTCATGCCTTTCTCAGCAGGAAAACGGATTTGTGTGGGAGAAGCCCTGGCCGGCATGGAGCTGTTTTTATTCCTG 
ACCTCCATTTTACAGAACTTTAACCTGAAATCTCTGGTTGACCCAAAGAACCTTGACACCACTCCAGTTGTCAATGGATTTGCCTC 
TGTGCCGCCCTTCTACCAGCTGTGCTTCATTCCTGTCCACCACCACCACTGA 

MAKKTSSKGRPPGPTPLPVIGNILQIGIKDISKSLTNLSKVYGPVFTLYFGLKPIWLHGYEAVKEALIDLGEEFSGRGIFPLAER 
ANRGFGIVFSNGKKWKEIRRFSLMTLRNFGMGKRSIEDRVQEEARCLVEELRKTKASPCDPTFILGCAPCNVICSIIFHKRFDYKD 
QQFLNLMEKLNENIKILSSPWIQVCNNFPALLDYFPGTHNKLLKNVAFMKSYILEKVKEHQESMDMNNPQDFIDCFLMKMEKEKHN 
QPSEFTIESLENTAVDLFGAGTETTSTTLRYALLLLLKHPEVTAKVQEEIERVIGRNRSPCMQDRSHMPYTDAVVHEVQRYIDLLP 
TSLPHAVTCDIKFRNYLIPKGTTILISLTSVLHDNKEFPNPEMFDPHHFLDEGGNFKKSKYFMPFSAGKRICVGEALAGMELFLFL 
TSILQNFNLKSLVDPKNLDTTPVVNGFASVPPFYQLCFIPVHHHH 



1363 SEQ ID NO: 53 & 54 

ATGGCTAAGAAAACGAGCTCTAAAGGGCGGCCGCCTGGCCCCACTCCTCTCCCAGTGATTGGAAATATCCTACAGATAGGTATTAA 
GGACATCAGCAAATCCTTAACCAATCTCTCAAAGGTCTATGGCCCTGTGTTCACTCTGTATTTTGGCCTGAAACCCATAGTGGTGC 
TGCATGGATATGAAGCAGTGAAGGAAGCCCTGATTGATCTTGGAGAGGAGTTTTCTGGAAGAGGCATTTTCCCACTGGCTGAAAGA 
GCTAACAGAGGATTTGGAATTGTTTTCAGCAATGGAAAGAAATGGAAGGAGATCCGGCGTTTCTCCCTCATGACGCTGCGGAATTT 
TGGGATGGGGAAGAGGAGCATTGAGGACCGTGTTCAAGAGGAAGCCCGCTGCCTTGTGGAGGAGTTGAGAAAAACCAAGGCCTCAC 
CCTGTGATCCCACTTTCATCCTGGGCTGTGCTCCCTGCAATGTGATCTGCTCCATTATTTTCCATAAACGTTTTGATTATAAAGAT 
CAGCAATTTCTTAACTTAATGGAAAAGTTGAATGAAAACATCAAGATTTTGAGCAGCCCCTGGATCCAGGTCTACAATAATTTCCC 
TGCTATCCTTGATTATTTCCCGGGAACTCACAACAAATTACTTAAAAACGTTGCTTTTATGAAAAGTTATATTTTGGAAAAAGTAA 
AAGAACACCAAGAATCAATGGACATGAACAACCCTCAGGACTTTATTGATTGCTTCCTGATGAAAATGGAGAAGGAAAAGCACAAC 
CAACCATCTGAATTTACTATTGAAAGCTTGGAAAACACTGCAGTTGACTTGTTTGGAGCTGGGACAGAGACGACAAGCACAACCCT 
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GAGATATGCTCTCCTTCTCCTGCTGAAGCACCCAGAGGTCACAGCTAMGTCCAGGAAGAGATTGAACGTGTGATTGGCAGAAACC 
GGAGCCCCTGCATGCAAGACAGGAGCCACATGCCCTACACAGATGCTGTGGTGCACGAGGTCCAGAGATACATTGACCTTCTCCCC 
ACCAGCCTGCCCCATGCAGTGACCTGTGACATTAAATTCAGAAACTATCTCATTCCCAAGGGCACAACCATATTAATTTCCCTGAC 
TTCTGTGCTACATGACAACAAAGAATTTCCCAACCCAGAGATGTTTGACCCTCATCACTTTCTGGATGAAGGTGGCAATTTTAAGA 
AAAGTAAATACTTCATGCCTTTCTCAGCAGGAAAACGGATTTGTGTGGGAGAAGCCCTGGCCGGCATGGAGCTGTTTTTATTCCTG 
ACCTCCATTTTACAGAACTTTAACCTGAAATCTCTGGTTGACCCAAAGAACCTTGACACCACTCCAGTTGTCAATGGATTTGCCTC 
TGTGCCGCCCTTCTACCAGCTCTGCTTCATTCCTGTCCACCACCACCACTGA 

MAKKTSSKGRPPGPTPLPVIGNILQIGIKDISKSLTNLSKVYGPVFTLYFGLKPIVVLHGYEAVKEALIDLGEEFSGRGIFPLAER 
ANRGFGIVFSNGKKWKEIRRFSLMTLRNFGMGJCRSIEDRVQEEARCLVEELRKTKASPCDPTFILGCAPCNVICSIIFHKRFDYKD 
QQFLNLMEKLNENIKILSSPWIQVYNNFPAILDYFPGTHNKLLKNVAFMKSYI^^ 

QPSEFTIESLENTAVDLFGAGTETTSTTLRYALLLLLKHPEVTAKVQEEIERVIGRNRSPCMQDRSHMPYTDAWHEVQRYIDLLP 
TSLPHAVTCDIKFRNYLIPKGTTILISLTSVLHDNKEFPNPEMFDPHHFLDEGGNFKKSKYFMPFSAGKRICVGEALAGMELFLFL 
TSILQNFNLKSLVDPKNLDTTPWNGFASVPPFYQLCFIPVHHHH 



1364 SEQ ID NO:55 & 56 

ATGGCTAAGAAAACGAGCTCTAAAGGGCGGCCGCCTGGCCCCACTCCTCTCCCAGTGATTGGAAATATCCTACAGATAGGTATTAA 
GGACATCAGCAAATCCTTAACCAATCTCTCAAAGGTCTATGGCCCTGTGTTCACTCTGTATTTTGGCCTGAAACCCATAGTGGTGC 
TGCATGGATATGAAGCAGTGAAGGAAGCCCTGATTGATCTTGGAGAGGAGTTTTCTGGAAGAGGCATTTTCCCACTGGCTGAAAGA 
GCTAACAGAGGATTTGGAATTGTTTTCAGCAATGGAAAGAAATGGAAGGAGATCCGGCGTTTCTCCCTCATGACGCTGCGGAATTT 
TGGGATGGGGAAGAGGAGCATTGAGGACCGTGTTCAAGAGGAAGCCCGCTGCCTTGTGGAGGAGTTGAGAAAAACCAAGGCCTCAC 
CCTGTGATCCCACTTTCATCCTGGGCTGTGCTCCCTGCAATGTGATCTGCTCCATTATTTTCCATAAACGTTTTGATTATAAAGAT 
CAGC AATTT CTT AACTT AATGGAAAAGTT G AATG AAAAC ATCAAG ATTTT G AGCAGCCCCTGGATC CAGGTCT ACAATAATTT CCC 
TGCTCTCATTGATTATTTCCCGGGAACTCACAACAAATTACTTAAAAACGTTGCTTTTATGAAAAGTTATATTTTGGAAAAAGTAA 
AAG AACACC AAGAATC AATGGAC ATG AACAAC CCTC AGGACTTT ATTGATTG CTTCCTG AT GAAAAT GGAG AAGG AAAAG CAC AAC 
CAACCATCTGAATTTACTATTGAAAGCTTGGAAAACACTGCAGTTGACTTGTTTGGAGCTGGGACAGAGACGACAAGCACAACCCT 

.. GAGATATGCTCTCCTTCTCCTGCTGAAGCACCCAGAGGTCACAGCTAAAGTCCAGGAAGAGATTGAACGTGTGATTGGCAGAAACC 
GGAGCCCCTGCATGCAAGACAGGAGCCACATGCCCTACACAGATGCTGTGGTGCACGAGGTCCAGAGATACATTGAGCTTCTCCCC 
..^^AGCAGCCTGCCCCATGCAGTGACCTGTGACATTAAATTCAGAAACTATCTCATTCCCAAGGGCACAACCATATTAAT.T.TCCCTGAC* 

,\ TTCTGTGCTACATGACAACAAAGAATTTCCCAACCCAGAGATGTTTGACCCTCATCACTTTCTGGATGAAGGTGGCAATTTTAAGA' 
^AAAGTAMTACTTC^TGCCTTTCTC^ 

^^.ACCTCCATTTTACAGAACTTTAACCTGAAATCTCTGGTTGACCCAAAGAACCTTGACACCACTCCAGTTGTCAATGGATTTGCCTC 
iTGTGCCGCCCTTCTACCAGCTCTGCTTCATTCCTGTCCACCACCACCACTGA "V - '* 

Vwi^KTSSKGRPPGPTPLPVIGNILQIGIKDISKSLTNLSKVYGPVFTLYFGLKPIWLHGYEAVKEALIDLGEEFSG 

rANRGFGIVFSNGKKWKEIRRFSLMTLRNFGMGKRSIEDRVQEEARCLVEELRKTKASPCDPTFILGCAPCNVICSIIFHKRFDYKD 
QQFLNI^EKLNENIKILSSPWIQVYNNFPALIDYFPGTHNKLLKNVAFMKSYILEKVKEHQESMDMNNPQDFIDCFLMKMEKEKHN 
QPSEFTIESLENTAVDLFGAGTETTSTTLRYALLLLLKHPEVTAKVQEEIERVIGRNRSPCMQDRSHMPYTDAWHEVQRYIDLLP 
TSLPHAVTCDIKFRNYLIPKGTTILISLTSVLHDNKEFPNPEMFDPHHFLDEGGNFKKSKYFMPFSAGKRICVGEALAGMELFLFL 
TSILQNFNLKSLVDPKNLDTTPWNGFASVPPFYQLCFIPVHHHH 



1366 SEQ ID NO: 57 & 59 

ATGGCTAAGAAAACGAGCTCTAAAGGGCGGCCGCCTGGCCCCACTCCTCTCCCAGTGATTGGAAATATCCTACAGATAGGTATTAA 
GGACATCAGCAAATCCTTAACCAATCTCTCAAAGGTCTATGGCCCTGTGTTCACTCTGTATTTTGGCCTGAAACCCATAGTGGTGC 
TGCATGGATATGAAGCAGTGAAGGAAGCCCTGATTGATCTTGGAGAGGAGTTTTCTGGAAGAGGCATTTTCCCACTGGCTGAAAGA 
GCT AAC AGAGGATTTGGAATTGTTTTCAGCAATGGAAAGAAATGG AAGG AG ATCCGGCGTTTCTCCCTC ATG ACGCTGCGG AATTT 
TGGGATGGGGAAGAGGAGCATTGAGGACCGTGTTCAAGAGGAAGCCCGCTGCCTTGTGGAGGAGTTGAGAAAAACCAAGGCCTCAC 
CCTGTGATCCCACTTTCATCCTGGGCTGTGCTCCCTGCAATGTGATCTGCTCCATTATTTTCCATAAACGTTTTGATTATAAAGAT 
CAGCAATTTCTTAACTTAATGGAAAAGTTGAATGAAAACATCAAGATTTTGAGCAGCCCCTGGATCCAGATTTGCAATAATTTCCC 
TGCTCTCCTTGATTATTTCCCGGGAACTCACAACAAATTACTTAAAAACGTTGCTTTTATGAAAAGTTATATTTTGGAAAAAGTAA 
AAG AAC ACC AAG AAT CAAT GG ACATGAACAACCCTC AGG ACTT T ATTG ATTGCT T CCTG ATG AAAATGG AGAAGG AAAAGC ACAAC 
CAACCATCTG AATTT ACT ATTGAAAGCTTGG AAAAC ACT GCAGTTGACTTGTTTGGAGCTGGG AC AG AGACGACAAGC AC AACCCT 
GAGATATGCTCTCCTTCTCCTGCTGAAGCACCCAGAGGTCACAGCTAAAGTCCAGGAAGAGATTGAACGTGTGATTGGCAGAAACC 
GGAGCCCCTGCATGCAAGACAGGAGCCACATGCCCTACACAGATGCTGTGGTGCACGAGGTCCAGAGATACATTGACCTTCTCCCC 
ACCAGCCTGCCCCATGCAGTGACCTGTGACATTAAATTTAGAAACTATCTCATTCCCAAGGGCACAACCATATTAATTTCCCTGAC 
TTCTGTGCTACATGACAACAAAGAATTTCCCAACCCAGAGATGTTTGACCCTCATCACTTTCTGGATGAAGGTGGCAATTTTAAGA 
AAAGTAAATACTTCATGCCTTTCTCAGCAGGAAAACGGATTTGTGTGGGAGAAGCCCTGGCCGGCATGGAGCTGTTTTTATTCCTG 
ACCTCCATTTTACAGAACTTTAACCTGAAATCTCTGGTTGACCCAAAGAACCTTGACACCACTCCAGTTGTCAATGGATTTGCCTC 
TGTGCCGCCCTTCTACCAGCTCTGCTTCATTCCTGTCCACCACCACCACTGA 

MAKKTSSKGRPPGPTPLPVIGNILQIGIKDISKSLTNLSKVYGPVFTLYFGLKPIVVLHGYEAVKEALIDLGEEFSGRGIFPLAER 
ANRGFGIVFSNGKKWKEIRRFSLMTLRNFGMGKRSIEDRVQEEARCLVEELRKTKASPCDPTFILGCAPCNVICSIIFHKRFDYKD 
QQFLNLMEKLNENIKILSSPWIQICNNFPALLDYFPGTHNKLLKNVAFMKSYILEKVKEHQESMDMNNPQDFIDCFLMKMEKEKHN 
QPSEFTIESLENTAVDLFGAGTETTSTTLRYALLLLLKHPEVTAKVQEEIERVIGRNRSPCMQDRSHMPYTDAVVHEVQRYIDLLP 



WO 03/035693 
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TSLPHAVTCDIKFRNYLIPKGTTILISLTSVLHDNKEFPNPEMFDPHHFLDEGGNFKKSKYFMPFSAGKRICVGEALAGMELFLFL 
TSILQNFNLKSLVDPKNLDTTPVVNGFASVPPFYQLCFIPVHHHH 



1367 SEQ ID NO: 59 & 60 



ATGGCTAAGAAAACGAGCTCTAAAGGGCGGCCGCCTGGCCCCACTCCTCTCCCAGTGATTGGAAATATCCTACAGATAGGTATTAA 
GGACATCAGCAAATCCTTAACCAATCTCTCAAAGGTCTATGGCCCTGTGTTCACTCTGTATTTTGGCCTGAAACCCATAGTGGTGC 
TGCATGGATATGAAGCAGTGAAGGAAGCCCTGATTGATCTTGGAGAGGAGTTTTCTGGAAGAGGCATTTTCCCACTGGCTGAAAGA 
GCTAACAGAGGATTTGGAATTGTTTTCAGCAATGGAAAGAAATGGAAGGAGATCCGGCGTTTCTCCCTCATGACGCTGCGGAATTT 
TGGGATGGGGAAGAGGAGCATTGAGGACCGTGTTCAAGAGGAAGCCCGCTGCCTTGTGGAGGAGTTGAGAAAAACCAAGGCCTCAC 
CCTGTGATCCCACTTTCATCCTGGGCTGTGCTCCCTGCAATGTGATCTGCTCCATTATTTTCCATAAACGTTTTGATTATAAAGAT 
CAGCAATTTCTTAACTTAATGGAAAAGTTGAATGAAAACATCAAGATTTTGAGCAGCCCCTGGATCCAGATTTGCAATAATTTCCC 
TGCTCTCATTGATTATTTCCCGGGAACTCACAACAAATTACTTAAAAACGTTGCTTTTATGAAAAGTTATATTTTGGAAAAAGTAA 
AAGAACACCAAGAATCAATGGACATGAACAACCCTCAGGACTTTATTGATTGCTTCCTGATGAAAATGGAGAAGGAAAAGCACAAC 
CAACCATCTGAATTTACTATTGAAAGCTTGGAAAACACTGCAGTTGACTTGTTTGGAGCTGGGACAGAGACGACAAGCACAACCCT 
GAGATATGCTCTCCTTCTCCTGCTGAAGCACCCAGAGGTCACAGCTAAAGTCCAGGAAGAGATTGAACGTGTGATTGGCAGAAACC 
GGAGCCCCTGCATGCAAGACAGGAGCCACATGCCCTACACAGATGCTGTGGTGCACGAGGTCCAGAGATACATTGACCTTCTCCCC 
ACCAGCCTGCCCCATGCAGTGACCTGTGACATTAAATTCAGAAACTATCTCATTCCCAAGGGCACAACCATATTAATTTCCCTGAC 
TTCTGTGCTACATGACAACAAAGAATTTCCCAACCCAGAGATGTTTGACCCTCATCACTTTCTGGATGAAGGTGGCAATTTTAAGA 
AAAGTAAATACTTCATGCCTTTCTCAGCAGGAAAACGGATTTGTGTGGGAGAAGCCCTGGCCGGCATGGAGCTGTTTTTATTCCTG 
ACCTCCATTTTACAGAACTTTAACCTGAAATCTCTGGTTGACCCAAAGAACCTTGACACCACTCCAGTTGTCAATGGATTTGCCTC 
TGTGCCGCCCTTCTACCAGCTCTGCTTCATTCCTGTCCACCACCACCACTGA 

MAKKTSSKGRPPGPTPLPVIGNIUJIGIKDISKSLTNLSKVYGPVFTLYFGLKPIVVLHGYEAVKEALIDLGEEFSGRGIFPLAER 
ANRGFGIVFSNGKKWKEIRRFSLMTLRNFGMGKRSIEDRVQEEARCLVEELRKTKASPCDPTFILGCAPCNVICSIIFHKRFDYKD 
QQFLNLMEKLNENIKILSSPWIQICNNFPALIDYFPGTHNKLLKNVAFMKSYILEKV^ 

QPSEFTIESLENTAVDLFGAGTETTSTTLRYALLLLLKHPEVTAKVQEEIERVIGRNRSPCMQDRSHMPYTDAWHEVQRYIDLLP 
TSLPHAVTCDIKFRNYLIPKGTTILISLTSVLHDNKEFPNPEMFDPHHFLDEGGNFKKSKYFMPFSAGKRICVGEALAGMELFLFL 
TSILQNFNLKSLVDPKNLDTTPWNGFASVPPFYQLCFIPVHHHH .-x- < r.~ v - 

1368 SEQ ID NO: 61 & 62 t \K'"- 

ATGGCTAAGAAAACGAGCTCTAAAGGGCGGCCGCCTGGCCCCACTCCTCTCCCAGTGATTGGAAATATCCTACAGATAGGTATTAA 
GGACATCAGCAAATCCTTAACCAATCTCTCAAAGGTCTATGGCCCTGTGTTCACTCTGTATTTTGGCCTGAAACCCATAGTGGTGC 
TGC ATGG AT ATG AAGC AGTG AAGGAAGC C CTGATT G AT CT TGG AGAGG AGTTT T CTGG AAG AGGC AT TT TCCC ACTGGCTG AAAG A 
GCTAACAGAGGATTTGGAATTGTTTTCAGCAATGGAAAGAAATGGAAGGAGATCCGGCGTTTCTCCCTCATGACGCTGCGGAATTT 
TGGGATGGGGAAGAGGAGCATTGAGGACCGTGTTCAAGAGGAAGCCCGCTGCCTTGTGGAGGAGTTGAGAAAAACCAAGGCCTCAC 
CCTGTGATCCCACTTTC^TCCTGGGCTGTGCTCCCTGCAATGTGATCTGCTCCATTATTTTCCATAAACGTTTTGATTATAAAGAT 
CAGCAATTTCTTAACTTAATGGAAAAGTTGAATGAAAACATCAAGATTTTGAGCAGCCCCTGGATCCAGATTTGCAATAATTTCCC 
TGCTATCATTGATTATTTCCCGGGAACTCACAACAAATTACTTAAAAACGTTGCTTTTATGAAAAGTTATATTTTGGAAAAAGTAA 
AAG AACACC AAG AAT C AATGGAC ATGAAC AACC CTC AGG ACTTT ATTG AT TGCTT CCTG ATG AAAATGG AGAAGG AAAAG C ACAAC 
CAACCATCTGAATTTACTATTGAAAGCTTGGAAAACACTGCAGTTGACTTGTTTGGAGCTGGGACAGAGACGACAAGCACAACCCT 
GAGATATGCTCTCCTTCTCCTGCTGAAGCACCCAGAGGTCACAGCTAAAGTCCAGGAAGAGATTGAACGTGTGATTGGCAGAAACC 
GGAGCCCCTGCATGCAAGACAGGAGCCACATGCCCTACACAGATGCTGTGGTGCACGAGGTCCAGAGATACATTGACCTTCTCCCC 
ACCAGCCTGCCCCATGCAGTGACCTGTGACATTAAATTCAGAAACTATCTCATTCCCAAGGGCACAACCATATTAATTTCCCTGAC 
TTCTGTGCTACATGACAACAAAGAATTTCCCAACCCAGAGATGTTTGACCCTCATCACTTTCTGGATGAAGGTGGCAATTTTAAGA 
AAAGTAAATACTTCATGCCTTTCTCAGCAGGAAAACGGATTTGTGTGGGAGAAGCCCTGGCCGGCATGGAGCTGTTTTTATTCCTG 
ACCTCCATTTTACAGAACTTTAACCTGAAATCTCTGGTTGACCCAAAGAACCTTGACACCACTCCAGTTGTCAATGGATTTGCCTC 
TGTGCCGCCCTTCTACCAGCTCTGCTTCATTCCTGTCCACCACCACCACTGA 



MAKKTSSKGRPPGPTPLPVIGNILQIGIKDISKSLTNLSKVYGPVFTLYFGLKPIVVLHGYEAVKEALIDLGEEFSGRGIFPLAER 
ANRGEXSIVFSNGKKWKEIRRFSLMTLRNFGMGKRSIEDRVQEEARCLVEELRKTKASPCDPTFILGCAPCNVICSIIFHKRFDYKD 
QQFLNLMEKLNENIKILSSPWIQICNNFPAIIDYFPGTHNKLLKNVAFMKSYILEKVKEHQESMDMNNPQDFIDCFLMKMEKEKHN 
QPSEFTIESLENTAVDLFGAGTETTSTTLRYALLLLLKH PEVTAKVQEEIERVIGRNRSPCMQDRSHMPYTDAVVHEVQRYIDLLP 
TSLPHAVTCDIKFRNYLIPKGTTILISLTSVLHDNKEFPNPEMFDPHHFLDEGGNFKKSKYFMPFSAGKRICVGEALAGMELFLFL 
TSILQNFNLKSLVDPKNLDTTPWNGFASVPPFYQLCFI PVHHHH 



1369 SEQ ID NO: 63 & 64 



ATGGCTAAGAAAACGAGCTCTAAAGGGCGGCCGCCTGGCCCCACTCCTCTCCCAGTGATTGGAAATATCCTACAGATAGGTATTAA 
GGACATCAGCAAATCCTTAACCAATCTCTCAAAGGTCTATGGCCCTGTGTTCACTCTGTATTTTGGCCTGAAACCCATAGTGGTGC 
TGCATGGATATGAAGCAGTGAAGGAAGCCCTGATTGATCTTGGAGAGGAGTTTTCTGGAAGAGGCATTTTCCCACTGGCTGAAAGA 
GCTAACAGAGGATTTGGAATTGTTTTCAGCAATGGAAAGAAATGGAAGGAGATCCGGCGTTTCTCCCTCATGACGCTGCGGAATTT 
TGGGATGGGGAAGAGGAGCATTGAGGACCGTGTTCAAGAGGAAGCCCGCTGCCTTGTGGAGGAGTTGAGAAAAACCAAGGCCTCAC 
CCTGTGATCCCACTTTCATCCTGGGCTGTGCTCCCTGCAATGTGATCTGCTCCATTATTTTCCATAAACGTTTTGATTATAAAGAT 



WO 03/035693 PCT/GB02/04872 
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CAGCAATTTCTTAACTTAATGGAAAAGTTGAATGAAAACATCAAGATTTTGAGCAGCCCCTGGATCCAGGTCTACAATAATTTCCC 
TGCTATCATTGATTATTTCCCGGGAACTCACAACAAATTACTTAAAAACGTTGCTTTTATGAAAAGTTATATTTTGGAAAAAGTAA 
AAGAACACCAAGAATCAATGGACATGAACAACCCTCAGGACTTTATTGATTGCTTCCTGATGAAAATGGAGAAGGAAAAGCACAAC 
CAACCATCTGAATTTACTATTGAAAGCTTGGAAAACACTGCAGTTGACTTGTTTGGAGCTGGGACAGAGACGACAAGCACAACCCT 
GAGATATGCTCTCCTTCTCCTGCTGAAGCACCCAGAGGTCACAGCTAAAGTCCAGGAAGAGATTGAACGTGTGATTGGCAGAAACC 
GGAGCCCCTGCATGCAAGACAGGAGCCACATGCCCTACACAGATGCTGTGGTGCACGAGGTCCAGAGATACATTGACCTTCTCCCC 
ACCAGCCTGCCCCATGCAGTGACCTGTGACATTAAATTCAGAAACTATCTCATTCCCAAGGGCACAACCATATTAATTTCCCTGAC 
TTCTGTGCTACATGACAACAAAGAATTTCCCAACCCAGAGATGTTTGACCCTCATCACTTTCTGGATGAAGGTGGCAATTTTAAGA 
AAAGTAAATACTTCATGCCTTTCTCAGCAGGAAAACGGATTTGTGTGGGAGAAGCCCTGGCCGGCATGGAGCTGTTTTTATTCCTG 
ACCTCCATTTTACAGAACTTTAACCTGAAATCTCTGGTTGACCCAAAG7^ACCTTGACACCACTCCAGTTGTCAATGGATTTGCCTC 
TGTGCCGCCCTTCTACCAGCTCTGCTTCATTCCTGTCCACCACCACCACTGA 

MAKKTSSKGRPPGPTPLPVIGNILQIGIKDISKSLTNLSKVYGPVFTLYFGLKPIWLHGYEAVKEALIDLGEEFSGRGIFPLAER 
ANRGFGIVFSNGKKWKEIRRFSLMTLRNFGMGKRSIEDRVQEEARCLVEELRKTKASPCDPTFILGCAPCNVICSIIFHKRFDYKD 
QQFLNLMEKLNENIKILSSPWIQVYNNFPAIIDYFPGTHNKLLKNVAFMKSYILEKVKEHQEStlDMNNPQDFIDCFLMKMEKEKHN 
QPSEFTIESLENTAVDLFGAGTETTSTTLRYALLLLLKHPEVTAKVQEEIERVIGRNRSPCMQDRSHMPYTDAWHEVQRYIDLLP 
TSLPHAVTCDIKFRNYLIPKGTTILISLTSVLHDNKEFPNPEMFDPHHFLDEGGNFKKSKYFMPFSAGKRICVGEALAGMELFLFL 
TS I LQNFNLKS LVDPKNLDTT PWNGFAS VPP FYQLCFI PVHHHH 

1370 SEQ ID NO: 65 & 66 

ATGGCTAAGAAAACGAGCTCTAAAGGGCGGCCGCCTGGCCCCACTCCTCTCCCAGTGATTGGAAATATCCTACAGATAGGTATTAA 
GGACATCAGCAAATCCTTAACCAATCTCTCAAAGGTCTATGGCCCTGTGTTCACTCTGTATTTTGGCCTGAAACCCATAGTGGTGC 
TGCATGGATATGAAGCAGTGAAGGAAGCCCTGATTGATCTTGGAGAGGAGTTTTCTGGAAGAGGCATTTTCCCACTGGCTGAAAGA 
GCTAACAGAGGATTTGGAATTGTTTTCAGCAATGGAAAGAAATGGAAGGAGATCCGGCGTTTCTCCCTCATGACGCTGCGGAATTT 
TGGGATGGGGAAGAGGAGCATTGAGGACCGTGTTCAAGAGGAAGCCCGCTGCCTTGTGGAGGAGTTGAGAAAAACCAAGGCCTCAC ' ' 

: CCTGTGATCCCACTTTCATCCTGGGCTGTGCTCCCTGCAATGTGATCTGCTCCATTATTTTCCATAAACGTTTTGATTATAAAGAT , " v 

iSAGCAATTTCTTAACTTAATGGAAAAGTTGAATGAAAA^ 

TGCTATCATTGATTATTTCCCGGGAACTCACAACAAATTACTTAAAAACGTTGCTTTTATGAAAAGTTATATTTTGGAAAAAGTAA ■'.T&t'ZWln 
AAGAACACCAAGAATCAATGGACATGAACAACCCTCAGGACTTTATTGATTGCTTCCTGATGAAAATGGAGAAGGAAAAGCACAAC WSACSfc* 
' "CAACCAT CT G AATT T ACTATTG AAAGCTT GGAAAACACTGC AGTTG AC T TGTTT GG AGCTGGG AC AG AG ACGACAAGC AC AACGC T ' $ i- c^^pA.-.' V 
.;GAGATATGCTCTCCTTCTCCTGCTGAAGCACCCAGAGGTCACAGCTAAAGTCCAGGAAGAGATTGAACGTGTGATTGGCAGAAACC: . 3 
GGAGCCCCTGCATGCAAGACAGGAGCCACATGCCCTACACAGATGCTGTGGTGCACGAGGTCCAGAGATACATTGACCTTCTCCCC^ ^C^Cf^C 
ACCAGCCTGCCCCATGCAGTGACCTGTGACATTAAATTCAGAAACTATCTCATTCCCAAGGGCACAACCATATTAATTTCCCTGAC: X$tti#PV 
TTCTGTGCTACATGACAACAAA 

;^GTA^TACTTCATGCCTTTCT(^CAGGAAAACGGATTTGTGTGGGAGAAGCCCTGGCCGGCATGGAGCTGTTTTTATTCCTG 
sACCTCCATTTTACAGAACTTTAACCTGAAATCTCTGGTTGACCCAAAGAACCTTGACACCACTCCAGTTGTCAATGGATTTGCCTC i« T 
TGTGCCGCCCTTCTACCAGCTCTGCTTCATTCCTGTCCACCACCACCACTGA 

MAKKTSSKGRPPGPTPLPVIGNILQIGIKDISKSLTNLSKVYGPVFTLYFGLKPIVVLHGYEAVKEALIDLGEEFSGRGIFPLAER 
AN RG FG I V FS NGKKWKE I RR FS LMT LRN FGMGKRS I E DRVQEEARC L VE ELRKT KAS PC D PT FI LGC APCNV ICSIIFHKRFDYKD 
QQFLNLMEKLNENIKILSSPWIQVCNNFPAIIDYFPGTHNKLLKNVAFMKSYILEKVKEHQESMDMNNPQDFIDCFLMKMEKE 
QPSEFTIESLENTAVDLFGAGTETTSTTLRYALLLLLKHPEVTAKVQEEIERVIGRNRSPCMQDRSHMPYTDAWHEVQRYIDLLP 
TSLPHAVTCDIKFRNYLIPKGTTILISLTSVLHDNKEFPNPEMFDPHHFLDEGGNFKKSKYFMPFSAGKRICVGEALAGMELFLFL 
TSILQNFNLKSLVDPKNLDTT P WNG F AS V P P F YQLC FI PVHHHH 

1371 SEQ ID NO: 67 & 68 

ATGGCTAAGAAAACGAGCTCTAAAGGGCGGCCGCCTGGCCCCACTCCTCTCCCAGTGATTGGAAATATCCTACAGATAGGTATTAA 
GGACATCAGCAAATCCTTAACCAATCTCTCAAAGGTCTATGGCCCTGTGTTCACTCTGTATTTTGGCCTGAAACCCATAGTGGTGC 
TGCATGGATATGAAGCAGTGAAGGAAGCCCTGATTGATCTTGGAGAGGAGTTTTCTGGAAGAGGCATTTTCCCACTGGCTGAAAGA 
GCTAACAGAGGATTTGGAATTGTTTTCAGCAATGGAAAGAAATGGAAGGAGATCCGGCGTTTCTCCCTCATGACGCTGCGGAATTT 
TGGGATGGGGAAGAGGAGCATTGAGGACCGTGTTCAAGAGGAAGCCCGCTGCCTTGTGGAGGAGTTGAGAAAAACCAAGGCCTCAC 
CCTGTGATCCCACTTTCATCCTGGGCTGTGCTCCCTGCAATGTGATCTGCTCCATTATTTTCCATAAACGTTTTGATTATAAAGAT 
CAGCAATTTCTTAACTTAATGGAAAAGTTGAATGAAAACATCAAGATTTTGAGCAGCCCCTGGATCCAGATTTATAATAATTTCCC 
TGCTCTCATTGATTATTTCCCGGGAACTCACAACAAATTACTTAAAAACGTTGCTTTTATGAAAAGTTATATTTTGGAAAAAGTAA 
AAG AACACC AAGAAT C AATGG AC ATG AAC AACCCT C AGG ACTTT ATTG ATT GCTT CCTGATG AAAATGG AGAAGG AAAAGC AC AAC 
CAACCATCTGAATTTACTATTGAAAGCTTGGAAAACACTGCAGTTGACTTGTTTGGAGCTGGGACAGAGACGACAAGCACAACCCT 
GAGATATGCTCTCCTTCTCCTGCTGAAGCACCCAGAGGTCACAGCTAAAGTCCAGGAAGAGATTGAACGTGTGATTGGCAGAAACC 
GGAGCCCCTGCATGCAAGACAGGAGCCACATGCCCTACACAGATGCTGTGGTGCACGAGGTCCAGAGATACATTGACCTTCTCCCC 
ACCAGCCTGCCCCATGCAGTGACCTGTGACATTAAATTCAGAAACTATCTCATTCCCAAGGGCACAACCATATTAATTTCCCTGAC 
TTCTGTGCTACATGACAACAAAGAATTTCCCAACCCAGAGATGTTTGACCCTCATCACTTTCTGGATGAAGGTGGCAATTTTAAGA 
AAAGTAAATACTTCATGCCTTTCTCAGCAGGAAAACGGATTTGTGTGGGAGAAGCCCTGGCCGGCATGGAGCTGTTTTTATTCCTG 
ACCTCCATTTTACAGAACTTTAACCTGAAATCTCTGGTTGACCCAAAGAACCTTGACACCACTCCAGTTGTCAATGGATTTGCCTC 
TGTGCCGCCCTTCTACCAGCTGTGCTTCATTCCTGTCCACCACCACCACTGA 



WO 03/035693 
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MAKKTSSKGRPPGPTPLPVIGNILQIGIKDISKSLTNLSKVYGPVFTLYFGLKPIWLHGYBAVKEALIDLGEEFSGRGIFPLAER 
ANRGFGIVFSNGKKWKEIRRFSLMTLRNFGMGKRSIEDRVQEEARCLVEELRKTKASPCDPTFILGCAPCNVICSIIFHKRFDYKD 
C^FLNLMEKLNENIKILSSPWIQIYNNFPALIDYFPGTHNKLLKNVAFMKSYILEKVKEHQESMDMNNPQDFIDCFLMKMEKEKHN 
QPSEFTIESLENTAVDLFGAGTETTSTTLRYALLLLLKHPEVTAKVQEEIERVIGRNRSPCMQDRSHMPYTDAVVHEVQRYIDLLP 
TSLPHAVTCDIKFRNYLIPKGTTILISLTSVLHDNKEFPNPEMFDPHHFLDEGGNFKKSKYFMPFSAGKRICVGEALAGMELFLFL 
TS I LQN FNLKS LVDPKNLDTT PWNGFASVPPFYQLCFI PVHHHH 



1372 SEQ ID NO: 69 & 70 

ATGGCTAAGAAAACGAGCTCTAAAGGGCGGCCGCCTGGCCCCACTCCTCTCCCAGTGATTGGAAATATCCTACAGATAGGTATTAA 
GGACATCAGCAAATCCTTAACCAATCTCTCAAAGGTCTATGGCCCTGTGTTCACTCTGTATTTTGGCCTGAAACCCATAGTGGTGC 
TGCATGGATATGAAGCAGTGAAGGAAGCCCTGATTGATCTTGGAGAGGAGTTTTCTGGAAGAGGCATTTTCCCACTGGCTGAAAGA 
GCTAACAGAGGATTTGGAATTGTTTTCAGCAATGGAAAGAAATGGAAGGAGATCCGGCGTTTCTCCCTCATGACGCTGCGGAATTT 
TGGGATGGGGAAGAGGAGCATTGAGGACCGTGTTCAAGAGGAAGCCCGCTGCCTTGTGGAGGAGTTGAGAAAAACCAAGGCCTCAC 
CCTGTGATCCCACTTTCATCCTGGGCTGTGCTCCCTGCAATGTGATCTGCTCCATTATTTTCCATAAACGTTTTGATTATAAAGAT 
CAGCAATTTCTTAACTTAATGGAAAAGTTGAATGAAAACATCAAGATTTTGAGCAGCCCCTGGATCCAGATTTCTAATAATTTCCC 
TGCTATCATTGATTATTTCCCGGGAACTCACAACAAATTACTTAAAAACGTTGCTTTTATGAAAAGTTATATTTTGGAAAAAGTAA 
AAGAACACCAAGAATCAATGGACATGAACAACCCTCAGGACTTTATTGATTGCTTCCTGATGAAAATGGAGAAGGAAAAGCACAAC 
CAACCATCTGAATTTACTATTGAAAGCTTGGAAAACACTGCAGTTGACTTGTTTGGAGCTGGGACAGAGACGACAAGCACAACCCT 
G AGAT ATGCTCTCCTT CTCCTGCTG AAGC ACC CAGAGGTCAC AGCT AAAGTCC AGG AAGAG ATTG AACGTGTG AT TGGC AG AAACC 
GGAGCCCCTGCATGCAAGACAGGAGCCACATGCCCTACACAGATGCTGTGGTGCACGAGGTCCAGAGATACATTGACCTTCTCCCC 
AC CAGCCTGCCCCATG C AGTGACCTGTG AC AT TAAATTCAGAAACT AT CTCATT CCC AAGGGC AC AACCAT ATT AATTTC C CTG AC 
TTCTGTGCTACATGACAACAAAGAATTTCCCAACCCAGAGATGTTTGACCCTCATCACTTTCTGGATGAAGGTGGCAATTTTAAGA 
AAAGTAAATACTTCATGCCTTTCTCAGCAGGAAAACGGATTTGTGTGGGAGAAGCCCTGGCCGGCATGGAGCTGTTTTTATTCCTG 
ACCTCCATTTTACAGAACTTTAACCTGAAATCTCTGGTTGACCCAAAGAACCTTGACACCACTCCAGTTGTCAATGGATTTGCCTC 
TGTGCCGCCCTTCTACCAGCTCTGCTTCATTCCTGTCCACCACCACCACTGA 

MAKKTSSKGRPPGPTPLPVIGNILQIGIKDISKSIjTNLSKVYGPVFTLYFGLKPIVVLHGYEAVKEALIDLGEEFSGRGIFPLAER 
ANRGFGIVFSNGKKWKEIRRFSLffTLRNFGMGKRSIEDRVQEEARCLVEELRKTKASPCDPTFII/jCAPCNVICSIIFHKRFDYKD 
QQFLNU4EKLNENIKILSSPWIQISNNFPAIIDYFPGTHNKLLKNVAFMKSYILEKVKEHQESMDMNNPQDFIDCFLMKMEKE 
QPSEFTIES LENT A VD L FGAGTETT STT LR YALL LLLKH PE VT AKVQE E IERVIGRNRS PCMQ DR S HM P YT D A WHE VQR Y I D LL P 
TSLPHAVTCDIKFRNYLIPKGTTILISLTSVLHDNKEFPNPEMFDPHHFLDEGGNFKKSKYFMPFSAGKRICVGEALAGMELFLFL 
T S I LQN FN LKS LVD PKNLDTT P WNG FAS V P P FYQLC FI PVHHHH 

1 • - ■ 

1391 SEQ ID NO:71 & 72 v.T.- « 

ATGGCTAAGAAAACGAGCTCTAAAGGGCGGCCGCCTGGCCCCACTCCTCTCCCAGTGATTGGAAATATCCTACAGATAGGTATTAA 
GGACATCAGCAAATCCTTAACCAATCTCTCAAAGGTCTATGGCCCTGTGTTCACTCTGTATTTTGGCCTGAAACCCATAGTGGTGC 
TGCATGGATATGAAGCAGTGAAGGAAGCCCTGATTGATCTTGGAGAGGAGTTTTCTGGAAGAGGCATTTTCCCACTGGCTGAAAGA 
GCTAACAGAGGATTTGGAATTGTTTTCAGCAATGGAAAGAAATGGAAGGAGATCCGGCGTTTCTCCCTCATGACGCTGCGGAATTT 
TGGGATGGGGAAGAGGAGCATTGAGGACCGTGTTCAAGAGGAAGCCCGCTGCCTTGTGGAGGAGTTGAGAAAAACCAAGGCCTCAC 
CCTGTGATCCCACTTTCATCCTGGGCTGTGCTCCCTGCAATGTGATCTGCTCCATTATTTTCCATAAACGTTTTGATTATAAAGAT 
CAGCAATTTCTTMCTTAATGGAAAAGTTGAATGAAAACATCAAGATTTTGAGCAGCCCCTGGATCCAGGtCTACAATAATTTCCC 
TGCTCTCCTTGATTATTTCCCGGGAACTCACAACAAATTACTTAAAAACGTTGCTTTTATGAAAAGTTATATTTTGGAAAAAGTAA 
AAGAACACCAAGAATCAATGGACATGCACAACCCTCAGGACTTTATTGATTGCTTCCTGATGAAAATGGAGAAGGAAAAGCACAAC 
CAACCATCTGAATTTACTATTGAAAGCTTGGAAAACACTGCAGTTGACTTGTTTGGAGCTGGGACAGAGACGACAAGCACAACCCT 
G AG AT AT GCTCTCCTTCTCCTGCTGAAGCACCCAGAGGTCACAGCTAAAGTCCAGGAAG AG ATTG AACGTGTG ATTGGCAGAAACC 
GGAGCCCCTGCATGCAAGACAGGAGCCACATGCCCTACACAGATGCTGTGGTGCACGAGGTCCAGAGATACATTGACCTTCTCCCC 
ACCAGCCTGCCCCATGCAGTGACCTGTGACATTAAATTCAGAAACTATCTCATTCCCAAGGGCACAACCATATTAATTTCCCTGAC 
TTCTGTGCTACATGACAACAAAGAATTTCCCAACCCAGAGATGTTTGACCCTCATCACTTTCTGGATGAAGGTGGCAATTTTAAGA 
AAAGTAAATACTTCATGCCTTTCTCAGCAGGAAAACGGATTTGTGTGGGAGAAGCCCTGGCCGGCATGGAGCTGTTTTTATTCCTG 
ACCTCCATTTTACAGAACTTTAACCTGAAATCTCTGGTTGACCCAAAGAACCTTGACACCACTCCAGTTGTCAATGGATTTGCCTC 
TGTGCCGCCCTTCTACCAGCTCTGCTTCATTCCTGTCCACCACCACCACTGA 

MAKKTSSKGRPPGPTPLPVIGNILQIGIKDI-SKSLTNLSKVYGPVFTLYFGLKPIWLHGYEAVKEALIDLGEEFSGRGIFPLAER 
ANRGFGIVFSNGKKWKEIRRFSLMTLRNFGMGKRSIEDRVQEEARCLVEELRKTKASPCDPTFILGCAPCNVICSIIFHKRFDYKD 
QQFLNLMEKLNENIKILSSPWIQVYNNFPALLDYFPGTHNKLLKNVAFMKSYILEKVKEHQESMDMHNPQDFIDCFLMKMEKEKHN 
QPSEFTIESLENTAVDLFGAGTETTSTTLRYALLLLLKHPEVTAKVQEEIERVIGRNRSPCMQDRSHMPYTDAWHEVQRYIDLLP 
TSLPHAVTCDIKFRNYLIPKGTTILISLTSVLHDNKEFPNPEMFDPHHFLDEGGNFKKSKYFMPFSAGKRICVGEALAGMELFLFL 
T S I LQN FN LKS LVDPKN L DTT P WNG FAS VP p FYQLC FI PVHHHH 



1392 SEQ ID NO:73 & 74 

ATGGCTAAGAAAACGAGCTCTAAAGGGCGGCCGCCTGGCCCCACTCCTCTCCCAGTGATTGGAAATATCCTACAGATAGGTATTAA 
GGACATCAGCAAATCCTTAACCAATCTCTCAAAGGTCTATGGCCCTGTGTTCACTCTGTATTTTGGCCTGAAACCCATAGTGGTGC 
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TGCATGGATATGAAGCAGTGAAGGAAGCCCTGATTGATCTTGGAGAGGAGTTTTCTGGAAGAGGCATTTTCCCACTGGCTGAAAGA 
GCTAACAGAGGATTTGGAATTGTTTTCAGCAATGGAAAGAAATGGAAGGAGATCCGGCGTTTCTCCCTCATGACGCTGCGGAATTT 
TGGGATGGGGAAGAGGAGCATTGAGGACCGTGTTCAAGAGGAAGCCCGCTGCCTTGTGGAGGAGTTGAGAAAAACCAAGGCCTCAC 
CCTGTGATCCCACTTTCATCCTGGGCTGTGCTCCCTGCAATGTGATCTGCTCCATTATTTTCCATAAACGTTTTGATTATAAAGAT 
CAGCAATTTCTTAACTTAATGGAAAAGTTGAATGAAAACATCAAGATTTTGAGCAGCCCCTGGATCCAGGTCTACAATAATTTCCC 
TGCTCTCCTTGATTATTTCCCGGGAACTCACAACAAATTACTTAAAAACGTTGCTTTTATGAAAAGTTATATTTTGGAAAAAGTAA 
AAGAACACCATGAATCAATGGACATGAACAACCCTCAGGACTTTATTGATTGCTTCCTGATGAAAATGGAGAAGGAAAAGCACAAC 
CAACCATCTGAATTTACTATTGAAAGCTTGGAAAACACTGCAGTTGACTTGTTTGGAGCTGGGACAGAGACGACAAGCACAACCCT 
GAG AT ATGCTCT CCTT CTC CTGCTGAAGCAC CCAGAGGTCACAGCT AAAGTCC AGGAAGAGATTGAACGTGTGAT TGGCAGAAACC 
GGAGCCCCTGCATGCAAGACAGGAGCCACATGCCCTACACAGATGCTGTGGTGCACGAGGTCCAGAGATACATTGACCTTCTCCCC 
ACCAGCCTGCCCCATGCAGTGACCTGTGACATTAAATTCAGAAACTATCTCATTCCCAAGGGCACAACCATATTAATTTCCCTGAC 
TTCTGTGCTACATGACAACAAAGAATTTCCCAACCCAGAGATGTTTGACCCTCATCACTTTCTGGATGAAGGTGGCAATTTTAAGA 
AAAGTAAATACTTCATGCCTTTCTCAGCAGGAAAACGGATTTGTGTGGGAGAAGCCCTGGCCGGCATGGAGCTGTTTTTATTCCTG 
ACCTCCATTTTACAGAACTTTAACCTGAAATCTCTGGTTGACCCAAAGAACCTTGACACCACTCCAGTTGTCAATGGATTTGCCTC 
TGTGCCGCCCTTCTACCAGCTCTGCTTCATTCCTGTCCACCACCACCACTGA 

MAKKTSSKGRPPGPTPLPVIGNILQIGIKDISKSLTNLSKVYGPVFTLYFGLKPIWLHGYEAVKEALIDLGEEFSGRGIFPLAER 
ANRGFGIVFSNGKKWKEIRRFSI^TLRNFGMGKRSIEDRVQEEARCLVEELRKTKASPCDPTFILGCAPCNVICSIIFHKRFDYKD 
QQFLNLMEKLNENIKILSSPWIQVYNNFPALLDYFPGTHNKLLKNVAFMKSYILEBCVKEHHESMDMNNPQDFI DCFLMKMEKEKHN 
QPSEFTIESLENTAVDLFGAGTETTSTTLRYALLLLLKHPEVTAKVQEEIERVIGRNRSPCMQDRSHMPYTDAWHEVQRYIDLLP 
TSLPHAVTCDIKFRNYLIPKGTTILISLTSVLHDNKEFPNPEMFDPHHFLDEGGNFKKSKYFMPFSAGKRICVGEALAGMELFLFL 
TS I LQN FNLKSLVDPKNLDTT PWNGFAS VPPFYQLC FI PVHHHH 



1394 SEQ ID NO:75 & 76 

ATGGCTAAGAAAACGAGCTCTAAAGGGCGGCCGCCTGGCCCCACTCCTCTCCCAGTGATTGGAAATATCCTACAGATAGGTATTAA 
GGACATCAGCAAATCCTTAACCAATCTCTCAAAGGTCTATGGCCCTGTGTTCACTCTGTATTTTGGCCTGAAACCCATAGTGGTGC 
TGCATGGATATGAAGCAGTGAAGGAAGCCCTGATTGATCTTGGAGAGGAGTTTTCtGGAAGAGGGATTTTCCCACTGGCTGAAAGA 
GCT AAC AGAGG ATT TGGAATTGTTTT CAG CAATGG AAAGAAATGG AAGGAGAT CCGGCGTTTCT GCCTCATGACGCT GCGGAATTT 
TGGGAT GGGGAAGAGGAGC AT T GAGG ACCGTGTTC AAGAGGAAGCCCGCTGC CTTGTGG AGGAG TTGAG AAAAACC AAGGCCTC AC 
CCTGTGATCCCACTTTGATCCTGGGCTGTGCTCCCTGCAATGTGATCTGCTCCAT^ATTTTGCATAAACGTTTTGATTATAAAGAT 
CAGCAATTTCTTAACTTAATGGAAAAGTTGAATGAAAACATCAAGATTTTGAGGAGCCCCTGGATCCAGGTCTACAATAATTTCCC 
•TGCTCTCCTTGATTATTTCCCGGGMCTCACAACAAATTACTTAAAAACGTTGCTTTTATGAAAAGTTATATTTTGGAAAAAG 
AAGAAC AC CAAG AAT CAATGG ACATG AAC AACCCTCAGGACTTT ATTGAT TGCTTC CT G ATG AAAATGG AG AAGG AAAAGCACAAC 
CAACC ATCTGAATTT ACT AT TG AAAGCTTGGAAAACACTGCAGTTGACTT GTTT GG AGCTGGGAC AGAG ACGAC AAG CAC AACCCT 
GAGATATGCTCTCCTTCTCCTGCTGAAGCACCCAGAGGTCACAGCTAAAGTCCAGGAAGA<3ATTGAACGTGTGATTGGCAGAAACC 
GG AGCCCCTGC ATGCAAG AC AGG AGCC ACATGC C CT ACAC AGATGCTGTGGTGCACGAGGT GC AGAG AT ACATTGACCTTCT CCCC 
ACCAGCCTGCCCCATGCAGTGACCTGTGACATTAAATTCAGAAACTATCTCATTCCCAAGGGCACAACCATATTAATTTCCCTGAC 
TTCTGTGCTACATGACAACAAAGAATTTCCCAACCCAGAGATGTTTGACCCTCATCACTTTCTGGATGAAGGTGGCAATTTTAAGA 
AAAGTAAATACTTCATGCCTTTCTCAGCAGGAAAACGGATTTGTGTGGGAGAAGCCCTGGCCGGCATGGAGCTGTTTTTATTCCTG 
ACCTCCATTTTACAGAACTTTAACCTGAAATCTCTGGTTGACCCAAAGAACCTTGACACCACTCCAGTTGTCAATGGATTTGCCTC 
TGTGCCGCCCTTCTACCACCTCTGCTTCATTCCTGTCCACCACCACCACTGA 

MAKKTSSKGRPPGPTPLPVIGNILQIGIKDISKSLTNLSKVYGPVFTLYFGLKPIWLHGYEAVKEALIDIX3EEFSGRGIFPLAER 
ANRGFGIVFSNGKKWKEIRRFSLMTLRNFGMGKRSIEDRVQEEARCLVEELRKTKASPCDPTFILGCAPCNVICSIIFHKRFDYKD 
QQFLNLMEKLNENIKILSSPWIQVYNNFPALLDYFPGTHNKLLKNVAFMKSYILEKVKEHQESMDMNNPQDFIDCFLMKMEKEKHN 
QPSEFTIESLENTAVDLFGAGTETTSTTLRYALLLLLKHPEVTAKVQEEIERVIGRNRSPCMQDRSHMPYTDAWHEVQRYIDLLP 
TSLPHAVTCDIKFRNYLIPKGTTILISLTSVLHDNKEFPNPEMFDPHHFLDEGGNFKKSKYFMPFSAGKRICVGEALAGMELFLFL 
TS I LQN FNLKSLVDPKNLDTTPVVNGFASVPPFYHLCFI PVHHHH 



1396 SEQ ID NO:77 £ 78 

ATGGCTAAGAAAACGAGCTCTAAAGGGCGGCCGCCTGGCCCCACTCCTCTCCCAGTGATTGGAAATATCCTACAGATAGGTATTAA 
GGACATCAGCAAATCCTTAACCAATCTCTCAAAGGTCTATGGCCCTGTGTTCACTCTGTATTTTGGCCTGAAACCCATAGTGGTGC 
TGCATGGATATGAAGCAGTGAAGGAAGCCCTGATTGATCTTGGAGAGGAGTTTTCTGGAAGAGGCATTTTCCCACTGGCTGAAAGA 
GCTAACAGAGGATTTGGAATTGTTTTCAGCAATGGAAAGAAATGGAAGGAGATCCGGCGTTTCTCCCTCATGACGCTGCGGAATTT 
TGGGATGGGGAAGAGGAGCATTGAGGACCGTGTTGAAGAGGAAGCCCGCTGCCTTGTGGAGGAGTTGAGAAAAACCAAGGCCTCAC 
CCTGTGATCCCACTTTCATCCTGGGCTGTGCTCCCTGCAATGTGATCTGCTCCATTATTTTCCATAAACGTTTTGATTATAAAGAT 
CAGCAATTTCTTAACTTAATGGAAAAGTTGAATGAAAACATCAAGATTTTGAGCAGCCCCTGGATCCAGGTCTACAATAATTTCCC 
TGCTCTCCTTGATTATTTCCCGGGAACTCACAACAAATTACTTAAAAACGTTGCTTTTATGAAAAGTTATATTTTGGAAAAAGTAA 
AAGAACACCAAGAATCAATGGACATGAACAACCCTCAGGACTTTATTGATTGCTTCCTGATGAAAATGGAGAAGGAAAAGCACAAC 
CAACCATCTGAATTTACTATTGAAAGCTTGGAAAACACTGCAGTTGACTTGTTTGGAGCTGGGACAGAGACGACAAGCACAACCCT 
GAGATATGCTCTCCTTCTCCTGCTGAAGCACCCAGAGGTCACAGCTAAAGTCCAGGAAGAGATTGAACGTGTGATTGGCAGAAACC 
GG AG CCCCT GC ATG AGCG AC AGG AG CC ACAT G CC CT AC AC AG ATGC TGTGGTGC ACGAGGT CC AG AG AT AC ATTG ACCTTCT CCCC 
ACCAGCCTGCCCCATGCAGTGACCTGTGACATTAAATTCAGAAACTATCTCATTCCCAAGGGCACAACCATATTAATTTCCCTGAC 
TTCTGTGCTACATGACAACAAAGAATTTCCCAACCCAGAGATGTTTGACCCTCATCACTTTCTGGATGAAGGTGGCAATTTTAAGA 



WO 03/035693 PCT/GB02/04872 

17/24 

AAAGTAAATACTTCATGCCTTTCTCAGCAGGAAAACGGATTTGTGTGGGAGAAGCCCTGGCCGGCATGGAGCTGTTTTTATTCCTG 
ACCTCCATTTTACAGAACTTTAACCTGAAATCTCTGGTTGACCCAAAGAACCTTGACACCACTCCAGTTGTCAATGGATTTGCCTC 
TGTGCCGCCCTTCTACCAGCTCTGCTTCATTCCTGTCCACCACCACCACTGA 

MAKKTSSKGRPPGPTPLPVIGNILQIGXKDISKSLTNLSKVYGPVFTLYFGLKPIVVLHGYEAVKEALIDLGEEFSGRGIFPLAER 
ANRGFGIVFSNGKKWKEIRRFSLMTLRNFGMGKRSIEDRVQEEARCLVEELRKTKASPCDPTFILGCAPCNVICSIIFHKRFDYKD 
QQFLNI^EKLNENIKILSSPWIQVYNNFPALLDYFPGTHNKLLKNVAFMKSYILEKVKEHQESMDMNNPQDFIDCFLMKMEKEKHN 
QPSEFTIESLENTAVDLFGAGTETTSTTLRYALLLLLKHPEVTAKVQEEIERVIGRNRSPCMSDRSHMPYTDAWHEVQRYIDLLP 
TSLPHAVTCDIKFRNYLIPKGTTILISLTSVLHDNKEFPNPEMFDPHHFLDEGGNFKKSKYFMPFSAGKRICVGEALAGMELFLFL 
TSILQNFNLKSLVDPKNLDTTPWNGFASVPPFYQLCFI PVHHHH 



1397 SEQ ID NO:79 & 80 

ATGGCTAAGAAAACGAGCTCTAAAGGGCGGCCGCCTGGCCCCACTCCTCTCCCAGTGATTGGAAATATCCTACAGATAGGTATTAA 
GGACATCAGCAAATCCTTAACCAATCTCTCAAAGGTCTATGGCCCTGTGTTCACTCTGTATTTTGGCCTGAAACCCATAGTGGTGC 
TGCATGGATATGAAGCAGTGAAGGAAGCCCTGATTGATCTTGGAGAGGAGTTTTCTGGAAGAGGCATTTTCCCACTGGCTGAAAGA 
G CT AACAG AGGATTTGG AATTGTTTT CAGC AATGG AAAGAAAT GGAAGGAGATCCGGCGTTT CT CC CTC ATGACGCT GCGGAATTT 
TGGGATGGGGAAGAGGAGCATTGAGGACCGTGTTCAAGAGGAAGCCCGCTGCCTTGTGGAGGAGTTGAGAAAAACCAAGGCCTCAC 
CCTGTGATCCCACTTTCATCCTGGGCTGTGCTCCCTGCAATGTGATCTGCTCCATTATTTTCCATAAACGTTTTGATTATAAAGAT 
CAGC AATTTCTTAACTT AATGG AAAAGTTGAATGAAAACATCAAGATTTTGAGCAGCCCCTGG AT CCAGGTCT AC AATAATTTCCC 
TGCTCTCCTTGATTATTTCCCGGGAACTCACAACAAATTACTTAAAAACGTTGCTTTTATGAAAAGTTATATTTTGGAAAAAGTAA 
7\AGAACACCAAGAATCAATGGACATGAACAACCCTCAGGACTTTATTGATTGCTTCCTGATGAAAATGGAGAAGGAAAAGCACAAC 
CAACCATCTGAATTTACTATTGAAAGCTTGGAAAACACTGCAGTTGACTTGTTTGGAGCTGGGACAGAGACGACAAGCACAACCCT 
GAGATATGCTCTCCTTCTCCTGCTGAAGCACCCAGAGGTCACAGCTAAAGTCCAGGAAGAGATTGAACGTGTGATTGGCAGAAACC 
GGAGCCCCTGCATGCAAGACAGGAGCCACATGCCCTACACAGATGCTGTGGTGCACGAGGTCCAGAGATACATTGACCTTCTCCCC 
ACCAGCCTGCCCCATGCAGTGACCTGTGACATTAAATTCAGAAACTATCTCATTCCCAAGGGCACAACCATATTAATTTCCCTGAC 
. - t . . ■-- ,TTCTGTGCTACATGACAACAAAGAATTTCCCAACCCAGAGATGTTTGACCCTCATCACTTTCTGGATGTTGGTGGCAATTTTAAGA 

^-'i'tv a^^^aagtaaatacttcatgcctttctcagcaggaaaacggatttgtgtgggaatcgccctggccggcatggagctgtttttattcctg: 
' 7 -V acctccattttacagaactttaacctgaaatctctggttgacccaaagaaccttgacaccactccagttgtcaatggatttgcctc- 
.^^gtgccgcccttctacc^gctctgcttcattcctgtccaccaccaccactga . w v 

Clc V .,&MAKKTSSKGRPPGPTPLPVIGNILQIGIKDISKSLTO 

" . ^ . 

■ : * . \ Y> ^QQFLNLMEKLNENIKILSSPWIQVYNNFPALLDYFPGTHNKLLKNVAFMKSYILEKVKEHQESMDMNNPQDFIDC 
»' , / - ;1:;QPSEFTIESLENTAVDLFGAGTETTSTTLRYALLLLLKHPEVTAKVQEEIERVIGRNRSPCMQDRSHMPYTDAVVHEVQRYIDLLP'. 

; ? ►« - ; ; -. .- . "TS I LQN FNLKSLVD PKN LDTTPWNG FAS VPPFYQLC FI PVHHHH 



1424 SEQ ID N0:81 & 82 

ATGGCTAAGAAAACGAGCTCTAAAGGGCGGCCGCCTGGCCCCACTCCTCTCCCAGTGATTGGAAATATCCTACAGATAGGTATTAA 
GGACATCAGCAAATCCTTAACCAATCTCTCAAAGGTTTATGGCCCTGTGTTC^ 

TGCATGGATATGAAGCAGTGAAGGAAGCCCTGATTGATCTTGGAGAGGAATTCTCTGGAAGAGGCATTTTCCCACTGGCTGAAAGA 
GCTAACAGAGGATTTGGAATTGTTTTCAGCAATGGAAAGAAATGGAAGGAGATCCGGCGTTTCTCCCTCATGACGCTGCGGAATTT 
TGGGATGGGGAAGAGGAGCATTGAGGACCGTGTTCAAGAGGAAGCCCGCTGCCTTGTGGAGGAGTTGAGAAAAACCAAGGCCTCAC 
CCTGTGATCCCACTTTCATCCTGGGCTGTGCTCCCTGCAATGTGATCTGCTCCATTATTTTCCATAAGCGCTTTGATTATAAAGAT 
CAGCAATTTCTTAACTTAATGGAAAAGTTGAATGAAAACATCAAGATTTTGAGCAGCCCCTGGATCCAGATCTGCAATAATTTTTC 
TGCTCCTATTGATTACTTCCCGGGAACTCACAACAAATTACTTAAAAACGTTGCTTTTATGAAAAGTTATATTTTGGAAAAAGTAA 
AAGAACACCAAGAATCAATGGACATGAACAACCCTCAGGACTTTATTGATTGCTTCCTGATGAAAATGGAGAAGGAAAAGCACAAC 
CAACCATCTGAATTTACTATTGAAAGCTTGGAAAACACTGCAGTTGACTTGTTTGGAGCTGGGACAGAGACGACAAGCACAACCCT 
GAGATATGCTCTCCTTCTCCTGCTGAAGCACCCAGAGGTCACAGCTAAAGTCCAGGAAGAGATTGAACGTGTGATTGGCAGAAACC 
GGAGCCCCTGCATGCAAGACAGGAGCCACATGCCCTACACAGATGCTGTGGTGCACGAGGTCCAGAGATACATTGACCTTCTCCCC 
ACCAGCCTGCCCCATGCAGTGACCTGTGACATTAAATTCAGAAACTATCTCATTCCCAAGGGCACAACCATATTAATTTCCCTGAC 
TTCTGTGCTACATGACAACAAAGAATTTCCCAACCCAGAGATGTTTGACCCTCATCACTTTCTGGATGAAGGTGGCAATTTTAAGA 
AAAGTAAATACTTCATGCCTTTCTCAGCAGGAAAACGGATTTGTGTGGGAGAAGCCCTGGCCGGCATGGAGCTGTTTTTATTCCTG 
ACCTCCATTTTACAGAACTTTAACCTGAAATCTCTGGTTGACCCAAAGAACCTTGACACCACTCCAGTTGTCAATGGATTTGCCTC 
TGTGCCGCCCTTCTACCAGCTGTGCTTCATTCCTGTCCATCATCATCATTGA 

MAKKTSSKGRPPGPTPLPVIGNILQIGIKDISKSLTNLSPCVYGPVFTLYFGLKPIVVLHGYEAVKEALIDLGEEFSGRGIFPLAER 
ANRGFGIVFSNGKKWKEIRRFSLMTLRNFGMGKRSIEDRVQEEARCLVEELRKTKASPCDPTFILGCAPCNVICSIIFHKRFDYKD 
QQFLNLMEKLNENIKILSSPWIQICNNFSAPIDYFPGTHNKLLKNVAFMKSYILEKVKEHQESMDMNNPQDFI DCFLMKMEKEKHN 
QPSEFTIESLENTAVDLFGAGTETTSTTLRYALLLLLKHPEVTAKVQEEIERVIGRNRSPCMQDRSHMPYTDAWHEVQRYIDLLP 
TSLPHAVTCDIKFRNYLIPKGTTILISLTSVLHDNKEFPNPEMFDPHHFLDEGGNFKKSKYFMPFSAGKRICVGEALAGMELFLFL 
TSILQNFNLKSLVDPKNLDTTPVVNGFASVPPFYQLCFI PVHHHH 
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1443 SEQ ID NO:83 & 84 

ATGGCTAAGAAAACGAGCTCTAAAGGGCGGCCGCCTGGCCCCACTCCTCTCCCAGTGATTGGAAATATCCTACAGATAGGTATTAA 
GGACATCAGCAAATCCTTAACCAATCTCTCAAAGGTCTATGGCCCTGTGTTCACTCTGTATTTTGGCCTGAAACCCATAGTGGTGC 
TGCATGGATATGAAGCAGTGAAGGAAGCCCTGATTGATCTTGGAGAGGAGTTTTCTGGAAGAGGCATTTTCCCACTGGCTGAAAGA 
GCTAACAGAGGATTTGGAATTGTTTTCAGCAATGGAAAGAAATGGAAGGAGATCCGGCGTTTCTCCCTCATGACGCTGCGGAATTT 
TGGGATGGGGAAGAGGAGCATTGAGGACCGTGTTCAAGAGGAAGCCCGCTGCCTTGTGGAGGAGTTGAGAAAAACCAAGGCCTCAC 
CCTGTGATCCCACTTTCATCCTGGGCTGTGCTCCCTGCAATGTGATCTGCTCCATTATTTTCCATAAACGTTTTGATTATAAAGAT 
CAGCAATTTCTTAACTTAATGGAAAAGTTGAATGAAAACATCAAGATTTTGAGCAGCCCCTGGATCCAGGTCTACAATAATTTCCC 
TGCTCTCCTTGATTATTTCCCGGGAACTCACAACAAATTACTTAAAAACGTTGCTTTTATGAAAAGTTATATTTTGGAAAAAGTAA 
AAG AAC AC C AAG AAT C AATGG ACATGAACAACCCT CAGGACTTT AT TGATTGCTTCCTG ATGAAAATGG AGAAGG AAAAGCAC AAC 
CAACCATCTGAATTTACTATTGAAAGCTTGGAAAACACTGCAGTTGACTTGTTTGGAGCTGGGACAGAGACGACAAGCACAACCCT 
GAG ATATG CTCTCCTT CTCCTGCT G AAGC ACCCAG AGGT CACAGCT AAAGT C CAGGAAG AG ATTGAAAATGTGATTGGC AGAAACC 
GGAGCCCCTGCATGCAAGACAGGAGCCACATGCCCTACACAGATGCTGTGGTGCACGAGGTCCAGAGATACATTGACCTTCTCCCC 
ACCAGCCTGCCCCATGCAGTGACCTGTGACATTAAATTCAGAAACTATCTCATTCCCAAGGGCACAACCATATTAATTTCCCTGAC 
TTCTGTGCTACATGACAACAAAGAATTTCCCAACCCAGAGATGTTTGACCCTCATCACTTTCTGGATGAAGGTGGCAATTTTAAGA 
AAAGTAAATACTTCATGCCTTTCTCAGCAGGAAAACGGATTTGTGTGGGAGAAGCCCTGGCCGGCATGGAGCTGTTTTTATTCCTG 
ACCTCCATTTTACAGAACTTTAACCTGAAATCTCTGGTTGACCCAAAGAACCTTGACACCACTCCAGTTGTCAATGGATTTGCCTC 
TGTGCCGCCCTTCTACCACCTCTGCTTCATTCCTGTCCACCACCACCACTGA 

MAKKTSSKGRPPGPTPLPVIGNILQIGIKDISKSLTNLSKVYGPVFTLYFGLKPIVVLHGYEAVKEALIDLGEEFSGRGIFPLAER 
ANRGFGIVFSNGKKWKEIRRFSLMTLRNFGMGKRSIEDRVQEEARCLVEELRKTKASPCDPTFILGCAPCNVICSIIFHKRFDYKD 
QQFLNLMEKLNENIKILSSPWIQVYNNFPALLDYFPGTHNKLLKNVAFMKSYILEKVKEHQESMDMNNPQDFIDCFLMKMEKEKH^ 
QPSEFTIESLENTAVDLFGAGTETTSTTLRYALLLLIJCHPEVTAKVQEEIENVJGRNRSPCMQDRSHMPYTDAVVHEVQRYIDLLP 
TSLPHAVTCDIKFRNYLIPKGTTILISLTSVLHDNKEFPNPEMFDPHHFLDEGGNFKKSKYFMPFSAGKRICVGEALAGMELFLFL 
T S I LQN FN LKS LV D PKN LDTT PWNG FAS V PP FYHLC FI P VHHHH 



1444 SEQ ID NO:85M"8:6*V7;-,r»: ■- 

ATGGCTAAGAAAACGAGCTCTAAAGGGCGGCCGCCTGGCCCCACTCCTCTCGCAGTGATTGGAAATATCCTACAGATAGGTATTAA 
GGACATCAGCAAATCCTTAACCAATCTCTCAAAGGTCTATGGCCCTGTGTTCACTCTGTATTTTGGCCTGAAACCCATAGTGGTGC 
TGCATGG ATATG AAGC AGTGAAGGAAGCGCTGATTGATCTTGGAGAGGAGTTTTCTGG AAG AGGCATTTTCCCACTGGCTGAAAG A 
GCTAACAGAGGATTTGGAATTGTTTTCAGCAATGGAAAGAAATGGAAGGAGATCCGGCGTTTCTCCCTCATGACGCTGCGGAATTT 
TGGGATGGGGAAGAGGAGCATTGAGGACCGTGTTCAAGAGGAAGCCCGCTGCCTTGTGGAGGAGTTGAGAAAAACCAAGGCCTCAC 
CCTGTGATCCCACTTTCATCCTGGGCTGTGCTCCCTGCAATGTGATCTGCTCCATTATTTTCCATAAACGTTTTGATTATAAAGAT 
CAGCAATTTCTTAACTTAATGGAAAAGTTGAATGAAAACATCCACATTTTGAGCAGCCCCTGGATCCAGGTCTACAATAATTTCCC 
TGCTCTCCTTGATTAT.TTCCCGGGAACTCACAACAAATTACTTAAAAACGTTGCTTTTATGAAAAGTTATATTTTGGAAAAAGTAA 
AAGAACACC AAG AATCAATGGACAT GAACAACCCTC AGGACT TT ATTGATTGCTT CCTGATGAAAATGGAGAAGG AAAAG CAC AAC 
CAACCATCTGAATTTACTATTGAAAGCTTGGAAAACACTGCAGTTGACTTGTTTGGAGCTGGGACAGAGACGACAAGCACAACCCT 
GAGATATGCTCTCCTTCTCCTGCTGAAGCACCCAGAGGTCACAGCTAAAGTCCAGGAAGAGATTGAACGTGTGATTGGCAGAAACC 
GGAGCCCCTGCATGCAAGACAGGAGCCACATGCCCTACACAGATGCTGTGGTGCACGAGGTCCAGAGATACATTGACCTTCTCCCC 
ACCAGCCTGCCCCATGCAGTGACCTGTGACATTAAATTCAGAAACTATCTCATTCCCAAGGGCACAACCATATTAATTTCCCTGAC 
TTCTGTGCTACATGACAACAAAGAATTTCCCAACCCAGAGATGTTTGACCCTCATCACTTTCTGGATGTTGGTGGCAATTTTAAGA 
AAAGTAAATACTTCATGCCTTTCTCAGCAGGAAAACGGATTTGTGTGGGAGAAGCCCTGGCCGGCATGGAGCTGTTTTTATTCCTG 
ACCTCCATTTTACAGAACTTTAACCTGAAATCTCTGGTTGACCCAAAGAACCTTGACACCACTeCAGTTGTCAATGGATTTGCCTC 
TGTGCCGCCCTTCTACCAGCTCTGCTTCATTCCTGTCCACCACCACCACTGA 

MAKKTSSKGRPPGPTPLPVIGNILQIGIKDISKSLTNLSKVYGPVFTLYFGLKPIWLHGYEAVKEALIDLGEEFSGRGIFPLAER 
AN RG FG I V F S NGKKWKE I RR FS LMT LRN FGMGKRS I E DR VQEEARC L VE E LRKT KAS PC D PT F I LG C A PCN V I CS 1 1 FHKR FD YK D 
QQFLNLMEKLNENIHILSSPWIQVYNNFPALLDYFPGTHNKLLKNVAFMKSYILEK^^ 

QPSEFTIESLENTAVDLFGAGTETTSTTLRYALLLLLKHPEVTAKVQEEIERVIGRNRSPCMQDRSHMPYTDAVVHEVQRYIDLLP 
TSLPHAVTCDIKFRNYLIPKGTTILISLTSVLHDNKEFPNPEMFDPHHFLDVGGNFKKSKYFMPFSAGKRICVGEALAGMELFLFL 
TSILQNFNLKSLVDPKNLDTTPWNGFASVPPFYQLCFIPVHHHH 



1475 SEQ ID NO:87 & 88 

ATGGCTAAGAAAACGAGCTCTAAAGGGCGGCCGCCTGGCCCCACTCCTCTCCCAGTGATTGGAAATATCCTACAGATAGGTATTAA 
GGACATCAGCAAATCCTTAACCAATCTCTCAAAGGTCTATGGCCCTGTGTTCACTCTGTATTTTGGCCTGAAACCCATAGTGGTGC 
TGCATGGATATGAAGCAGTGAAGGAAGCCCTGATTGATCTTGGAGAGGAGTTTTCTGGAAGAGGCATTTTCCCACTGGCTGAAAGA 
GCTAACAGAGGATTTGGAATTGTTTTCAGCAATGGAAAGAAATGGAAGGAGATCCGGCGTTTCTCCCTCATGACGCTGCGGAATTT 
TGGGATGGGGAAGAGGAGCATTGAGGACCGTGTTCAAGAGGAAGCCCGCTGCCTTGTGGAGGAGTTGAGAAAAACCAAGGCCTCAC 
CCTGTGATCCCACTTTCATCCTGGGCTGTGCTCCCTGCAATGTGATCTGCTCCATTATTTTCCATAAACGTTTTGATTATAAAGAT 
CAGCAATTTCTTAACTTAATGGAAAAGTTGAATGAAAACATCGAGATTTTGAGCAGCCCCTGGATCCAGGTCTACAATAATTTCCC 
TGCTCTCCTTGATTATTTCCCGGGAACTCACCACAAATTACTTAAAAACGTTGCTTTTATGAAAAGTTATATTTTGGAAAAAGTAA 
AAGAACACCAAGAATCAATGGACATGAACAACCCTCAGGACTTTATTGATTGCTTCCTGATGAAAATGGAGAAGGAAAAGCACAAC 
CAACCATCTGAATTTACTATTGAAAGCTTGGAAAACACTGCAGTTGACTTGTTTGGAGCTGGGACAGAGACGACAAGCACAACCCT 
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GAGATATGCTCTCCTTCTCCTGCTGAAGCACCCAGAGGTCACAGCTAAAGTCCAGGAAGAGATTGAACGTGTGATTGGCAGAMCC 
GGAGCCCCTGCATGCAAGACAGGAGCCACATGCCCTACACAGATGCTGTGGTGCACGAGGTCCAGAGATACATTGACCTTCTCCCC 
ACCAGCCTGCCCCATGCAGTGACCTGTGACATTAAATTCAGAAACTATCTCATTCCCAAGGGCACAACCATATTAATTTCCCTGAC 
TTCTGTGCTACATGACAACAAAGAATTTCCCAACCCAGAGATGTTTGACCCTCATCACTTTCTGGATGAAGGTGGCAATTTTAAGA 
AAAGTAAATACTTCATGCCTTTCTCAGCAGGAAAACGGATTTGTGTGGGAGAAGCCCTGGCCGGCATGGAGCTGTTTTTATTCCTG 
ACCTCCATTTTACAGAACTTTAACCTGAAATCTCTGGTTGACCCAAAGAACCTTGACACCACTCCAGTTGTCAATGGATTTGCCTC 
TGTGCCGCCCTTCTACCAGCTCTGCTTCATTCCTGTCCACCACCACCACTGA 

MAKKTSSKGRPPGPTPLPVIGNILQIGIKDISKSLTNLSKVYGPVFTLYFGLKPIWLHGYEAVKEALIDLGEEFSGRGIFPLAER 
ANRGFGIVFSNGKKWKEIRRFSLMTLRNFGMGKRSIEDRVQEEARCLVEELRKTKASPCDPTFILGCAPCNVICSIIFHKRFDYKD 
QQFLNLMEKLNENIEILSSPWIQVYNNFPALLDYFPGTHHKLLKNVAFMKSYILEKVKEHQESMDMNNPQDFIDCFLMKMEKEKHN 
QPSEFTIESLENTAVDLFGAGTETTSTTLRYALLLLLKHPEVTAKVQEEIERVIGRNRSPCMQDRSHMPYTDAWHEVQRYIDLLP 
TSLPHAVTCDIKFRNYLIPKGTTILISLTSVLHDNKEFPNPEMFOPHHFLDEGGNFKKSKYFMPFSAGKRICVGEALAGMELFLFL 
TSILQNFNLKSLVDPKNLDTTPWNGFASVPPFYQLCFIPVHHHH 



1477 SEQ ID NO:89 & 90 

ATGGCTAAGAAAACGAGCTCTAAAGGGCGGCCGCCTGGCCCCACTCCTCTCCCAGTGATTGGAAATATCCTACAGATAGGTATTAA 
GGACATCAGCAAATCCTTAACCAATCTCTCAAAGGTCTATGGCCCTGTGTTCACTCTGTATTTTGGCCTGAAACCCATAGTGGTGC 
TGCATGGATATGAAGCAGTGAAGGAAGCCCTGATTGATCTTGGAGAGGAGTTTTCTGGAAGAGGCATTAGCCCACTGGCTGAAAGA 
GCTAACAGAGGATTTGGAATTGTTTTCAGCAATGGAAAGAAATGGAAGGAGATCCGGCGTTTCTCCCTCATGACGCTGCGGAATTT 
TGGGATGGGGAAGAGGAGCATTGAGGACCGTGTTCAAGAGGAAGCCCGCTGCCTTGTGGAGGAGTTGAGAAAAACCAAGGCCTCAC 
CCTGTGATCCCACTTTCATCCTGGGCTGTGCTCCCTGCAATGTGATCTGCTCCATTATTTTCCATAAACGTTTTGATTATAAAGAT 
CAGCAATTTCTTAACTTAATGGAAAAGTTGAATGAAAACATCGAGATTTTGAGCAGCCCCTGGATCCAGGTCTACAATAATTTCCC 
TGCTCTCCTTGATTATTTCCCGGGAACTCACAACAAATTACTTAAAAACGTTGCTTTTATGAAAAGTTATATTTTGGAAAAAGTAA 
AAGAACACCAAGAATCAATGGACATGAACAACCCTCAGGACTTTATTGATTGCTTCCTGATGAAAATGGAGAAGGAAAAGCACAAC 
CAACCATCTGAATTTACTATTGAAAGCTTGGAAAACACTGCAGTTGACTTGTTTGGAGCTGGGACAGAGACGACAAGCACAACCCT 
GAGATATGCTCTCCTTCTCCTGCTGAAGCACCCAGAGGTCACAGCTAAAGTCCAGGAAGAGATTGAACGTGTGATTGGCAGAAACC 
GGAGCCCCTGCATGCAAGACAGGAGCCACATGCCCTACACAGATGCTGTGGTGCACGAGGTCCAGAGATACATTGACCTTCTCCCC 
ACCAGCCTGCCCCATGCAGTGACCTGTGACATTAAATTCAGAAACTATCTCATTCCCAAGGGCACAACCATATTAATTTCCCTGAC 
TTCTGTGCTACATGACAACAAAGAATTTCCCAACCCAGAGATGTTTGACCCTCATCACTTTCTGGATGAAGGTGGCAATTTTAAGA 
AAAGTAAATACTTCATGCCTTTCTCAGCAGGAAAACGGATTTGTGTGGGAGAAGCCCTGGCCGGCATGGAGCTGTTTTTATTCCTG 
ACCTCCATTTTACAGAACTTTAACCTGAAATCTCTGGTTGAGCCAAAGAACGTTGACACCACTCCAGTTGTCAATGGATTTGCCTC 
TGTGCCGCCCTTCTACCAGCTCTGCTTCATTCCTGTCCACCACCACCACTGA> 

MAKKTSSKGRPPGPTPLPVIGNILQIGIKDISKSLTNLSKVYGPVFTLYFGLKPIWLHGYEAVKEALIDLGEEFSGRGISPLA^ 
ANRGFGIVFSNGKKWKEIRRFSLMTLRNFGMGKRSIEDRVQEEARCLVEELRKTKASPCDPTFILGCAPCNVICSIIFHKRFDYKD 
QQ FLN LMEKLN ENIEILSSPWIQVYNNF PAL L D Y F PGT HNKL LKN V A FMKS Y I LEK VKEHQE SM DMNN PQ D F I DC FLMKMEKE KHN 
QPSEFTIESLENTAVDLFGAGTETTSTTLRYALLLLLKHPEVTAKVQEEIERVIGRNRSPCMQDRSHMPYTDAWHEVQRYIDLLP 
TSLPHAVTCDIKFRNYLIPKGTTILISLTSVLHDNKEFPNPEMFDPHHFLDEGGNFKKSKYFMPFSAGKRICVGEALAGMELFLFL 
T S I LQN FN LKS LV D PKN L DTT P WNG FAS V P P FYQLC FI P VHHHH 



1491 SEQ ID NO; 91 & 92 

ATGGCTAAGAAAACGAGCTCTAAAGGGCGGCCGCCTGGCCCCACTCCTCTCCCAGTGATTGGAAATATCCTACAGATAGGTATTAA 
GGACATCAGCAAATCCTTAACCAATCTCTCAAAGGTCTATGGCCCTGTGTTCACTCTGTATTTTGGCCTGAAACCCATAGTGGTGC 
TGCATGGATATGAAGCAGTGAAGGAAGCCCTGATTGATCTTGGAGAGGAGTTTTCTGGAAGAGGCATTTTCCCACTGGCTGAAAGA 
GCTAACAGAGGATTTGGAATTGTTTTCAGCAATGGAAAGAAATGGAAGGAGATCCGGCGTTTCTCCCTCATGACGCTGCGGAATTT 
TGGGATGGGGAAGAGGAGCATTGAGGACCGTGTTCAAGAGGAAGCCCGCTGCCTTGTGGAGGAGTTGAGAAAAACCAAGGCCTCAC 
CCTGTGATCCCACTTTCATCCTGGGCTGTGCTCCCTGCAATGTGATCTGCTCCATTATTTTCCATAAACGTTTTGATTATAAAGAT 
CAGCAATTTCTTAACTTAATGGAAAAGTTGAATGAAAACATCGAGATTGCGAGCAGCCCCTGGATCCAGGTCTACAATAATTTCCC 
TGCTCTCCTTGATTATTTCCCGGGAACTCACAACAAATTACTTAAAAACGTTGCTTTTATGAAAAGTTATATTTTGGAAAAAGTAA 
AAG AACACC AAG AAT C AATGGAC ATG AAC AACC CTC AGGACTTT ATTG AT TG CTTC CTGATG AAAATGG AG AAGG AAAAGC AC AAC 
CAACCATCTGAATTTACTATTGAAAGCTTGGAAAACACTGCAGTTGACTTGTTTGGAGCTGGGACAGAGACGACAAGCACAACCCT 
GAGATATGCTCTCCTTCTCCTGCTGAAGCACCCAGAGGTCACAGCTAAAGTCCAGGAAGAGATTGAACGTGTGATTGGCAGAAACC 
GGAGCCCCTGCATGCAAGACAGGAGCCACATGCCCTACACAGATGCTGTGGTGCACGAGGTCCAGAGATACATTGACCTTCTCCCC 
ACCAGCCTGCCCCATGCAGTGACCTGTGACATTAAATTCAGAAACTATCTCATTCCCAAGGGCACAACCATATTAATTTCCCTGAC 
TTCTGTGCTACATGACAACAAAGAATTTCCCAACCCAGAGATGTTTGACCCTCATCACTTTCTGGATGAAGGTGGCAATTTTAAGA 
AAAGTAAATACTTCATGCCTTTCTCAGCAGGAAAACGGATTTGTGTGGGAGAAGCCCTGGCCGGCATGGAGCTGTTTTTATTCCTG 
ACCTCCATTT T AC AG AACT TT AACCT G AAATCTCTGGTTG ACCC AAAG AACCTT G AC ACC ACTCC AGT T GTC AATGG ATTTGC CTC 
TGTGCCGCCCTTCTACCAGCTCTGCTTCATTCCTGTCCACCACCACCACTGA 

MAKKTSSKGRPPGPTPLPVIGNILQIGIKDISKSLTNLSKVYGPVFTLYFGLKPIVVLHGYEAVKEALIDLGEEFSGRGIFPLAER 
ANRGFGIVFSNGKKWKEIRRFSLMTLRNFGMGKRSIEDRVQEEARCLVEELRKTKAS PCDPTFILGCAPCNVICSIIFHKRFDYKD 
QQFLNIJ4EKLNENIEIASSPWIQVYNNFPALLDYFPGTHNKLLKNVAFMKSYIIJ:KVKEHQESMDMNNPQDFIDCFLMKMEKEKHN 
QPSEFTIESLENTAVDLFGAGTETTSTTLRYALLLLLKHPEVTAKVQEEIERVIGRNRSPCMQDRSHMPYTDAVVHEVQRYIDLLP 
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TSLPHAVTCDIKFRNYLIPKGTTILISLTSVLHDNKEFPNPEMFDPHHFLDEGGNFKKSKYFMPFSAGKRICVGEALAGMELFLFL 
TSILQNFNLKSLVDPKNLDTTPWNGFASVPPFYQLCFI PVHHHH 



1595 SEQ ID NO: 93 & 94 

ATGGCTAAGAAAACGAGCTCTAAAGGGCGGCCGCCTGGCCCCACTCCTCTCCCAGTGATTGGAAATATCCTACAGATAGGTATTAA 
GGACATCAGCAAATCCTTAACCAATCTCTCAAAGGTCTATGGCCCTGTGTTCACTCTGTATTTTGGCCTGAAACCCATAGTGGTGC 
TGCATGGATATGAAGCAGTGAAGGAAGCCCTGATTGATCTTGGAGAGGAGTTTTCTGGAAGAGGCCATTTCCCACTGGCTGAAAGA 
GCTAACAGAGGATTTGGAATTGTTTTCAGCAATGGAAAGAAATGGAAGGAGATCCGGCGTTTCTCCCTCATGACGCTGCGGAATTT 
TGGGATGGGGAAGAGGAGCATTGAGGACCGTGTTCAAGAGGAAGCCCGCTGCCTTGTGGAGGAGTTGAGAAAAACCAAGGCCTCAC 
CCTGTGATCCCACTTTCATCCTGGGCTGTGCTCCCTGCAATGTGATCTGCTCCATTATTTTCCATAAACGTTTTGATTATAAAGAT 
CAGCAATTTCTTAACTTAATGGAAAAGTTGAATGAAAACATCGAGATTTTGAGCAGCCCCTGGATCCAGGTCTACAATAATTTCCC 
TGCTCTCCTTGATTATTTCCCGGGAACCCATAACAAATTACTTAAAAACCTTGCTTTTATGGAAAGTGATATTTTGGAGAAAGTAA 
AAGAACACCAAGAATCGATGGACATCAACAACCCTCGGGACTTTATTGATTGCTTCCTGATCAAAATGGAGAAGGAAAAGCAAAAC 
CAACAGTCTGAATTCACTATTGAAAACTTGGTAATCACTGCAGCTGACTTACTTGGAGCTGGGACAGAGACAACAAGCACAACCCT 
GAGATATGCTCTCCTTCTCCTGCTGAAGCACCCAGAGGTCACAGCTAAAGTCCAGGAAGAGATTGAACGTGTCGTTGGCAGAAACC 
GGAGCCCCTGCATGCAAGACAGGAGCCACATGCCCTACACAGATGCTGTGGTGCACGAGGTCCAGAGATACATTGACCTTCTCCCC 
ACCAGCCTGCCCCATGCAGTGACCTGTGACATTAAATTCAGAAACTATCTCATTCCCAAGGGCACAACCATATTAATTTCCCTGAC 
TTCTGTGCTACATGACAACAAAGAATTTCCCAACCCAGAGATGTTTGACCCTCATCACTTTCTGGATGAAGGTGGCAATTTTAAGA 
AAAGTAAATACTTCATGCCTTTCTCAGCAGGAAAACGGATTTGTGTGGGAGAAGCCCTGGCCGGCATGGAGCTGTTTTTATTCCTG 
ACCTCCATTTTACAGAACTTTAACCTGAAATCTCTGGTTGACCCAAAGAACCTTGACACCACTCCAGTTGTCAATGGATTTGCCTC 
TGTGCCGCCCTTCTACCAGCTCTGCTTCATTCCTGTCCACCACCACCACTGA 

MAKKTSSKGRPPGPTPLPVIGNILQIGIKDISKSLTNLSKVYGPVFTLYFGLKPIWLHGYEAVKEALIDLGEEFSGRGHFPLAER 
ANRGFGIVFSNGKKWKEIRRFSLMTLRNFGMGKRSIEDRVQEEARCLVEELRKTKASPCDPTFILGCAPCNVICSIIFHKRFDYKD 
QQFLNLMEKLNENIEILSSPWIQVYNNFPALLDYFPGTHNKLLKNLAFMESDILEKVKEHQESMDINNPRDFIDCFLIKMEKEKQN 
QQSEFTIENLVITAADLLGAGTETTSTTLRYALLLLLKHPEVTAKVQEEIERWGRNRSPCMQDRSHMPYTDAWHEVQRYIDLLP 
TSLPHAVTCDIKFRNYLIPKGTTILISLTSVLHDNKEFPNPEMFDPHHFLDEGGNFKKSKYFMPFSAGECRICVGEALA'GMELFLFL 
TSILQNFNLKSLVDPKNLDTTPVVNGFASVPPFYQLCFIPVHHHH - , v :; 




1600 SEQ ID NO: 95 & 96 ..^jA - 

ATGGCTAAGAAAACGAGCTCTAAAGGGCGGCCGCCTGGCCCCACTCCTCTCCCAGTGATTGGAAATATCCTACAlGATAGGTATTAA 
GGACATCAGCAAATCCTTAACCAATCTCTCAAAGGTCTATGGCCCTGTGTTCACTCTGTATTTTGGCCTGAAACCCATAGTGGTGC 
TGCATGGATATGAAGCAGTGAAGGAAGCCCTGATTGATCTTGGAGAGGAGTTTTCTGGAAGAGGCCATTTCCCACTGGCTGAAAGA 
GCTAACAGAGGATTTGGAATTGTTTTCAGCAATGGAAAGAAATGGAAGGAGATCCGGCGTTTCTCCCTCATGACGCTGCGGAATTT 
TGGGATGGGGAAGAGGAGCATTGAGGACCGTGTTCAAGAGGAAGCCCGCTGCCTTGTGGAGGAGTTGAGAAAAACCAAGGCCTCAC 
CCTGTGATCCCACTTTCATCCTGGGCTGTGCTCCCTGCAATGTGATCTGCTCCATTATTTTCCATAAACGTTTTGATTATAAAGAT 
CAGCAATTTCTTAACTTAATGGAAAAGTTGAATGAAAACATCGAGATTTTGAGCAGCCCCTGGATCCAGGTCTACAATAATTTCCC 
TGCTCTCCTTGATTATTTCCCGGGAACTCACAACAAATTACTTAAAAACGTTGCTTTTATGAAAAGTTATATTTTGGAAAAAGTAA 
AAG AAC ACCAAG AAT CAATGG ACATGAAC AACCCTC AGG ACTTT ATTGATTGCTT CCTG AT G AAAAT GGAG AAGGAAAAGCACAAC 
CAACC ATCTG AATTC ACT ATTG AAAACT TGG TAATC ACTGC AGCTGACT T ACTTGG AGCT GGGACAGAGAC AACAAGC ACAACC CT 
GAGATATGCTCTCCTTCTCCTGCTGAAGCACCCAGAGGTCACAGCTAAAGTCCAGGAAGAGATTGAACGTGTCGTTGGCAGAAACC 
GGAGCCCCTGCATGCAAGACAGGAGCCACATGCCCTACACAGATGCTGTGGTGCACGAGGTCCAGAGATACATTGACCTTATCCCC 
ACCAGCCTGCCCCATGCAGTGACCTGTGACATTAAATTCAGAAACTATCTCATTCCCAAGGGCACAACCATATTAATTTCCCTGAC 
TTCTGTGCTACATGACAACAAAGAATTTCCCAACCCAGAGATGTTTGACCCTCATCACTTTCTGGATGAAGGTGGCAATTTTAAGA 
AAAGTAAATACTTCATGCCTTTCTCAGCAGGAAAACGGATTTGTGTGGGAGAAGCCCTGGCCGGCATGGAGCTGTTTTTATTCCTG 
ACCTCCATTTTACAGAACTTTAACCTGAAATCTCTGGTTGACCCAAAGAACCTTGACACCACTCCAGTTGTCAATGGATTTGCCTC 
TGTGCCGCCCTTCTACCAGCTCTGCTTCATTCCTGTCCACCACCACCACTGA 

MAKKTSSKGRPPGPTPLPVIGNILQIGIKDISKSLTNLSKVYGPVFTLYFGLKPIVVLHGYEAVKEALIDLGEEFSGRGHFPLAER 
ANRGFGIVFSNGKKWKEIRRFSLMTLRNFGMGKRSIEDRVQEEARCLVEELRKTKASPCDPTFILGCAPCNVICSIIFHKRFDYKD 
QQFLNLMEKLNENIEILSSPWIQVYNNFPALLDYFPGTHNKLLKNVAFMKSYILEKVKEHQESMDMNNPQDFIDCFLMKMEKEKHN 
QPSEFTIENLVITAADLLGAGTETTSTTLRYALLLLLKHPEVTAKVQEEIERWGRNRSPCMQDRSHMPYTDAWHEVQRYIDLIP 
TSLPHAVTCDIKFRNYLIPKGTTILISLTSVLHDNKEFPNPEMFDPHHFLDEGGNFKKSKYFMPFSAGKRICVGEALAGMELFLFL 
TSILQNFNLKSLVDPKNLDTTPWNGFASVPPFYQLCFI PVHHHH 



1610 SEQ ID NO: 97 & 98 

ATGGCTAAGAAAACGAGCTCTAAAGGGCGGCCGCCTGGCCCCACTCCTCTCCCAGTGATTGGAAATATCCTACAGATAGGTATTAA 
GGACATCAGCAAATCCTTAACCAATCTCTCAAAGGTCTATGGCCCTGTGTTCACTCTGTATTTTGGCCTGAAACCCATAGTGGTGC 
TGCATGGATATGAAGCAGTGAAGGAAGCCCTGATTGATCTTGGAGAGGAGTTTTCTGGAAGAGGCATTTTCCCACTGGCTGAAAGA 
GCTAACAGAGGATTTGGAATTGTTTTCAGCAATGGAAAGAAATGGAAGGAGATCCGGCGTTTCTCCCTCATGACGCTGCGGAATTT 
TGGGATGGGGAAGAGGAGCATTGAGGACCGTGTTCAAGAGGAAGCCCGCTGCCTTGTGGAGGAGTTGAGAAAAACCAAGGCCTCAC 
CCTGTGATCCCACTTTCATCCTGGGCTGTGCTCCCTGCAATGTGATCTGCTCCATTATTTTCCATAAACGTTTTGATTATAAAGAT 
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CAGCAATTTCTTAACTTAATGGAAAAGTTGAATGAAAACATCGAGATTTTGAGCAGCCCCTGGATCCAGGTCTACAATAATTTCCC 
TGCTCTCCTTGATTATTTCCCGGGAACTCACAACAAATTACTTAAAAACGTTGCTTTTATGAAAAGTTATATTTTGGAAAAAGTAA 
AAGAACACCAAGAATCAATGGACATGAACAACCCTCAGGACTTTATTGATTGCTTCCTGATGAAAATGGAGAAGGAAAAGCACAAC 
CAACCATCTGAATTCACTATTGAAAACTTGGTAATCACTGCAGCTGACTTACTTGGAGCTGGGACAGAGACAACAAGCACAACCCT 
GAGATATGCTCTCCTTCTCCTGCTGAAGCACCCAGAGGTCACAGCTAAAGTCCAGGAAGAGATTGAACGTGTGATTGGCAGAAACC 
GGAGCCCCTGCATGCAAGACAGGAGCCACATGCCCTACACAGATGCTGTGGTGCACGAGGTCCAGAGATACATTGACCTTCTCCCC 
ACCAGCCTGCCCCATGCAGTGACCTGTGACATTAAATTCAGAAACTATCTCATTCCCAAGGGCACAACCATATTAATTTCCCTGAC 
TTCTGTGCTACATGACAACAAAGAATTTCCCAACCCAGAGATGTTTGACCCTCATCACTTTCTGGATGAAGGTGGCAATTTTAAGA 
AAAGTAAATACTTCATGCCTTTCTCAGCAGGAAAACGGATTTGTGTGGGAGAAGCCCTGGCCGGCATGGAGCTGTTTTTATTCCTG 
ACCTCCATTTTACAGAACTTTAACCTGAAATCTCTGGTTGACCCAAAGAACCTTGACACCACTCCAGTTGTCAATGGATTTGCCTC 
TGTGCCGCCCTTCTACCAGCTCTGCTTCATTCCTGTCCACCACCACCACTGA 

^4AKKTSSKGRPPGPTPLPVIGNILQIGIKDISKSLTNLSKVYGPVFTLYFGLKPIVVLHGYEAVKEALIDLGEEFSGRGIFPLAER 
ANRGFGIVFSNGKKWKEIRRFSI^TLRNFGMGKRSIEDRVQEEARCLVEELRKTKASPCDPTFILGCAPCNVICSIIFHKRFDYKD 
QQFLNLMEKLNENIEILSSPWIQVYNNFPALLDYFPGTHNKLLKWAFMKSYILEKVKEHQE 

QPSEFTIENLVITAADLLGAGTETTSTTLRYALLLLLKHPEVTAKVQEEIERVIGRNRSPCMQDRSHMPYTDAWHEVQRYIDLLP 
TSLPHAVTCDIKFRNYLIPKGTTILISLTSVLHDNKEFPNPEMFDPHHFLDEGGNFKKSKYFMPFSAGKRICVGEALAGMELFLFL 
T S I LQN FN LKS L V D PKN LDT T P WN G FAS V P P FYQLC FI P VH HHH 



1632 SEQ ID NO: 99 & 100 

ATGGCTAAGAAAACGAGCTCTAAAGGGCGGCCGCCTGGCCCCACTCCTCTCCCAGTGATTGGAAATATCCTACAGATAGGTATTAA 
GGACATCAGCAAATCCTTAACCAATCTCTCAAAGGTCTATGGCCCTGTGTTCACTCTGTATTTTGGCCTGAAACCCATAGTGGTGC 
TGCATGGATATGAAGCAGTGAAGGAAGCCCTGATTGATCTTGGAGAGGAGTTTTCTGGAAGAGGCCATTTCCCACTGGCTGAAAGA 
GCTAACAGAGGATTTGGAATTGTTTTCAGCAATGGAAAGAAATGGAAGGAGATCCGGCGTTTCTCCCTCATGACGCTGCGGAATTT 
TGGGATGGGGAAGAGGAGCATTGAGGACCGTGTTCAAGAGGAAGCCCGCTGCCTTGTGGAGGAGTTGAGAAAAACCAAGGCCTCAC 
CCTGTGATCCCAGTTTCATCCTGGGCTGTGCTCCCTGCAATGTGATCTGCTCCATTATTTTCCATAAACGTTTTGATTATAAAGAT 
CAGCAATTTCTTAACTTAATGGAAAAGTTGAATGAAAACATCGAGATTTTGAGCAGCCCCTGGATCCAGGTCTACAATAATTTCCC 
TGCTCTCCTTGATTATTtTCCGGGGAACTCACAACAAATTACTTAAAAACGTTGCTTTTATGAAAAGTTATATTTTGGAAAAAGTAA 
AAGAACACCAAGAATCAATGGACATGAACAACCCTCAGGACTTTATTGATTGCTTCCTGATGAAAATGGAGAAGGAAAAGCACAAC 
CAACCATCTGAAXTCACTATTGAAAACTTGGTAATCACTGCAGCTGACTTACTTGGAGCTGGGACAGAGACAACAAGCACAACCCT 
GAGATATGCTCTCCTTCTCCTGCTGAAGCACCCAGAGGTCACAGCTAAAGTCCAGGAAGAGATTGAACGTGTGATTGGCAGAAACC 
GGAGCCCCTGCATGCAAGACAGGAGCCACATGCCCTACACAGATGCTGTGGTGCACGAGGTCCAGAGATACATTGACCTTCTCCCC 
ACCAGCCTGCGCGATGCAGTGACCTGTGACATTAAATTCAGAAACTATCTCATTCCCAAGGGCACAACCATATTAATTTCCCTGAC 
TTCTGTGCTACATGACAACAAAGAATTTCCCAACCCAGAGATGTTTGACCCTCATCACTTTCTGGATGAAGGTGGCAATTTTAAGA 
AAAGTAAATAGTTCATGCCTOTCTCAGCAGGAAAACGGATTTGTGTGGGAGAAGCCCTGGCCGGCATGGAGCTGTTTTTATTCCTG 
ACCTCCATTTTACAGAACTTTAACCTGAAATCTCTGGTTGACCCAAAGAACCTTGACACCACTCCAGTTGTCAATGGATTTGCCTC 
TGTGCCGCCCTTCTACCAGCTCTGCTTCATTCCTGTCCACCACCACCACTGA 

MAKKTSSKGRPPGPTPLPVIGNILQIGIKDISKSLTNLSKVYGPVrTLYFGLKPIWLHGYEAVKEALIDLGEEFSGRGHFPLAER 
AN RG FG I V F SNGKKW KE I RR FS LMT LRN FGMGKRS I E DRVQE E ARCL VEE LRKTKAS PC D PT FI LGC APCNVI C S 1 1 FH KR F D YK D 
QQFLNLMEKLNENIEILSSPWIQVYNNFPALLDYFPGTHNKLLKNVAFMKSYILEKVKEHQESMDMNNPQDFI DCFLMKMEKEKHN 
Q P S E FT I EN L V IT AADLLGAGT E TT STT LRY ALL LLLKH PE VT AKVQEE I E R V IG RNR S PCMQ DR S HM P YT D AWHE VQR Y I D LL P 
T S L PHAVTC DIKFRNYLI PKGTT ILISLTSVLHDNKEFPNPEMFDPHHFL OEGGN FKKS KY FMP FS AGKR I CVGEALAGME L FL FL 
TSILQNFNLKSLVDPKNLDTTPWNGFASVPPFYQLCFIPVHHHH 



1661 SEQ ID NO: 101 & 102 

ATGGCTAAGAAAACGAGCTCTAAAGGGCGGCCGCCTGGCCCCACTCCTCTCCCAGTGATTGGAAATATCCTACAGATAGGTATTAA 
GGACATCAGCAAATCCTTAACCAATCTCTCAAAGGTCTATGGCCCTGTGTTCACTCTGTATTTTGGCCTGAAACCCATAGTGGTGC 
TGCATGGATATGAAGCAGTGAAGGAAGCCCTGATTGATCTTGGAGAGGAGTTTTCTGGAAGAGGCCATTTCCCACTGGCTGAAAGA 
GCTAACAGAGGATTTGGAATTGTTTTCAGCAATGGAAAGAAATGGAAGGAGATCCGGCGTTTCTCCCTCATGACGCTGCGGAATTT 
TGGGATGGGGAAGAGGAGCATTGAGGACCGTGTTCAAGAGGAAGCCCGCTGCCTTGTGGAGGAGTTGAGAAAAACCAAGGCCTCAC 
CCTGTGATCCCACTTTCATCCTGGGCTGTGCTCCCTGCAATGTGATCTGCTCCATTATTTTCCATAAACGTTTTGATTATAAAGAT 
CAGCAATTTCTTAACTTAATGGAAAAGTTGAATGAAAACATCGAGATTTTGAGCAGCCCCTGGATCCAGGTCTACAATAATTTCCC 
TGCTCTCCTTGATTATTTCCCGGGAACTCACAACAAATTACTTAAAAACGTTGCTTTTATGAAAAGTTATATTTTGGAAAAAGTAA 
AAGAACACCAAGAATCAATGGACATGAACAACCCTCAGGACTTTATTGATTGCTTCCTGATGAAAATGGAGAAGGAAAAGCACAAC 
CAACCATCTGAATTTACTATTGAAAACTTGGAAATCACTGCAGTTGACTTGTTTGGAGCTGGGACAGAGACGACAAGCACAACCCT 
GAGATATGCTCTCCTTCTCCTGCTGAAGCACCCAGAGGTCACAGCTAAAGTCCAGGAAGAGATTGAACGTGTGATTGGCAGAAACC 
GGAGCCCCTGCATGCAAGACAGGAGCCACATGCCCTACACAGATGCTGTGGTGCACGAGGTCCAGAGATACATTGACCTTCTCCCC 
ACCAGCCTGCCCCATGCAGTGACCTGTGACATTAAATTCAGAAACTATCTCATTCCCAAGGGCACAACCATATTAATTTCCCTGAC 
TTCTGTGCTACATGACAACAAAGAATTTCCCAACCCAGAGATGTTTGACCCTCATCACTTTCTGGATGAAGGTGGCAATTTTAAGA 
AAAGTAAATACTTCATGCCTTTCTCAGCAGGAAAACGGATTTGTGTGGGAGAAGCCCTGGCCGGCATGGAGCTGTTTTTATTCCTG 
ACCTCCATTTTACAGAACTTTAACCTGAAATCTCTGGTTGACCCAAAGAACCTTGACACCACTCCAGTTGTCAATGGATTTGCCTC 
TGTGCCGCCCTTCTACCAGCTCTGCTTCATTCCTGTCCACCACCACCACTGA 
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MAKKTSSKGRPPGPTPLPVIGNILQIGIKDISKSLTNLSKVYGPVFTLYFGLKPIVVLHGYEAVKEALIDLGEEFSGRGHFPLAER 
ANRGFGIVFSNGKKWKEIRRFSLMTLRNFGMGKRSIEDRVQEEARCLVEELRKTKASPCDPTFILGCAPCNVICSIIFHKRFDYKD 
QQFLNLMEKLNENIEILSSPWIQVYNNFPALLDYFPGTHNKLLKNVAFMKSYILEKVKEHQESMDMNNPQDFIDCFLMKMEKEKHN 
QPSEFTIENLEITAVDLFGAGTETTSTTLRYALLLLLKHPEVTAKVQEEIERVIGRNRSPCMQDRSHMPYTDAWHEVQRYIDLLP 
TSLPHAVTCDIKFRNYLIPKGTTILISLTSVLHDNKEFPNPEMFDPHHFLDEGGNFKKSKYFMPFSAGKRICVGEALAGMELFLFL 
TSILQNFNLKSLVDPKNLDTTPWNGFASVPPFYQLCFIPVHHHH 



1662 SEQ ID NO: 103 & 104 

ATGGCTAAGAAAACGAGCTCTAAAGGGCGGCCGCCTGGCCCCACTCCTCTCCCAGTGATTGGAAATATCCTACAGATAGGTATTAA 
GGACATCAGCAAATCCTTAACCAATCTCTCAAAGGTCTATGGCCCTGTGTTCACTCTGTATTTTGGCCTGAAACCCATAGTGGTGC 
TGCATGGATATGAAGCAGTGAAGGAAGCCCTGATTGATCTTGGAGAGGAGTTTTCTGGAAGAGGCATTTTCCCACTGGCTGAAAGA 
GCTAACAGAGGATTTGGAATTGTTTTCAGCAATGGAAAGAAATGGAAGGAGATCCGGCGTTTCTCCCTCATGACGCTGCGGAATTT 
TGGGATGGGGAAGAGGAGCATTGAGGACCGTGTTCAAGAGGAAGCCCGCTGCCTTGTGGAGGAGTTGAGAAAAACCAAGGCCTCAC 
CCTGTGATCCCACTTTCATCCTGGGCTGTGCTCCCTGCAATGTGATCTGCTCCATTATTTTCCATAAACGTTTTGATTATAAAGAT 
CAGCAATTTCTTAACTTAATGGAAAAGTTGAATGAAAACATCGAGATTTTGAGCAGCCCCTGGATCCAGGTCTACAATAATTTCCC 
TGCTCTCCTTGATTATTTCCCGGGAACTCACAACAAATTACTTAAAAACGTTGCTTTTATGAAAAGTTATATTTTGGAAAAAGTAA 
AAGAACACCAAGAATCAATGGACATGAACAACCCTCAGGACTTTATTGATTGCTTCCTGATGAAAATGGAGAAGGAAAAGCACAAC 
CAACCATCTGAATTTACTATTGAAAACTTGGAAAACACTGCAGTTGACTTGTTTGGAGCTGGGACAGAGACGACAAGCACAACCCT 
GAGATATGCTCTCCTTCTCCTGCTGAAGCACCCAGAGGTCACAGCTAAAGTCCAGGAAGAGATTGAACGTGTGATTGGCAGAAACC 
GGAGCCCCTGCATGCAAGACAGGAGCCACATGCCCTACACAGATGCTGTGGTGCACGAGGTCCAGAGATACATTGACCTTCTCCCC 
ACCAGCCTGCCCCATGCAGTGACCTGTGACATTAAATTCAGAAACTATCTCATTCCCAAGGGCACAACCATATTAATTTCCCTGAC 
TTCTGTGCTACATGACAACAAAGAATTTCCCAACCCAGAGATGTTTGACCCTCATCACTTTCTGGATGAAGGTGGCAATTTTAAGA 
AAAGTAAATACTTCATGCCTTTCTCAGCAGGAAAACGGATTTGTGTGGGAGAAGCCCTGGCCGGCATGGAGCTGTTTTTATTCCTG 
ACCTCCATTTTACAGAACTTTAACCTGAAATCTCTGGTTGACCCAAAGAACCTTGACACCACTCCAGTTGTCAATGGATTTGCCTC 
TGTGCCGCCCTTCTACCAGCTCTGCTTCATTCCTGTCCACCACCACCACTGA 

MAKKTSSKGRPPGPTPLPVIGNILQIGIKDISKSLTNLSK\^GPVFTLYFGLKPIVVLHGYEAVKEALIDLGEEFSGRGIFPLAER 
ANRGFGIVFSNGKKWKEIRRFSLMTLRNFGMGKRSIEDRVQEEARCLVEELRKTKASPCDPTFILGCAPCNVICSIIFHKRFDYKD 
QQFLNIi4EKLNENIEILSSPWIQVYNNFPALLDYFPGTHNKLLKNVAFMKSYILEKVKEHQESMDMNNPQDFIDCFLMKMEKEKHN 
QPSEFTIENLENTAVDLFGAGTETTSTTLRYALLLLLKHPEVTAKVQEEIERVIGRNRSPCMQDRSHMPYTDAWHEVQRYIDLLP 
TSLPHAVTCDIKFRNYLIPKGTTILISLTSVLHDNKEFPNPEMFDPHHFLDEGGNFKKSKYFMPFSAGKRICVGEALAGMELFLFL 
TSILQNFNLKSLVDPKNLDTTPWNGFASVPPFYQLCFIPVHHHH 

1664 SEQ ID NO: 105 & 106 Vv ■Jv^!? 

ATGGCTAAGAAAACGAGCTCTAAAGGGCGGCCGCCTGGCCCCACTCCTCTCCCAGTGATTGGAAATATCCTACAGATAGGTATTAA 
GGACATCAGCAAATCCTTAACCAATCTCTCAAAGGTCTATGGCCCTGTGTTCACTCTGTATTTTGGCCTGAAACCCATAGTGGTGC 
TGCATGGATATGAAGCAGTGAAGGAAGCCCTGATTGATCTTGGAGAGGAGTTTTCTGGAAGAGGCATTTTCCCACTGGCTGAAAGA 
GCTAACAGAGGATTTGGAATTGTTTTCAGCAATGGAAAGAAATGGAAGGAGATCCGGCGTTTCTCCCTCATGACGCTGCGGAATTT 
TGGGATGGGGAAGAGGAGCATTGAGGACCGTGTTCAAGAGGAAGCCCGCTGCCTTGTGGAGGAGTTGAGAAAAACCAAGGCCTCAC 
CCTGTGATCCCACTTTCATCCTGGGCTGTGCTCCCTGCAATGTGATCTGCTCCATTATTTTCCATAAACGTTTTGATTATAAAGAT 
CAGCAATTTCTTAACTTAATGGAAAAGTTGAATGAAAACATCGAGATTTTGAGCAGCCCCTGGATCCAGGTCTACAATAATTTCCC 
TGCTCTCCTTGATTATTTCCCGGGAACTCACAACAAATTACTTAAAAACGTTGCTTTTATGAAAAGTTATATTTTGGAAAAAGTAA 
AAGAACACCAAGAATCAATGGACATGAACAACCCTCAGGACTTTATTGATTGCTTCCTGATGAAAATGGAGAAGGAAAAGCACAAC 
CAACCATCTGAATTTACTATTGAAAACTTGGAAATCACTGCAGTTGACTTGTTTGGAGCTGGGACAGAGACGACAAGCACAACCCT 
G AGAT AT GCTCT CCTTCT CCTGCTGAAG C ACCC AGAGGTCAC AGCT AAAGTCCAGG AAG AGATTG AACGTGTG ATTGGC AGAAACC 
GGAGCCCCTGCATGCAAGACAGGAGCCACATGCCCTACACAGATGCTGTGGTGCACGAGGTCCAGAGATACATTGACCTTCTCCCC 
ACCAGCCTGCCCCATGCAGTGACCTGTGACATTAAATTCAGAAACTATCTCATTCCCAAGGGCACAACCATATTAATTTCCCTGAC 
TTCTGTGCTACATGACAACAAAGAATTTCCCAACCCAGAGATGTTTGACCCTCATCACTTTCTGGATGAAGGTGGCAATTTTAAGA 
AAAGTAAATACTTCATGCCTTTCTCAGCAGGAAAACGGATTTGTGTGGGAGAAGCCCTGGCCGGCATGGAGCTGTTTTTATTCCTG 
ACCTCCATTTTACAGAACTTTAACCTGAAATCTCTGGTTGACCCAAAGAACCTTGACACCACTCCAGTTGTCAATGGATTTGCCTC 
TGTGCCGCCCTTCTACCAGCTCTGCTTCATTCCTGTCCACCACCACCACTGA 

MAKKTSSKGRPPGPTPLPVIGNILQIGIKDISKSLTNLSKVYGPVFTLYFGLKPIWLHGYEAVKEALIDLGEEFSGRGIFPLAER 
ANRGFGIVFSNGKKWKEIRRFSLMTLRNFGMGKRSIEDRVQEEARCLVEELRKTKASPCDPTFILGCAPCNVICSIIFHKRFDYKD 
QQFLNLMEKLNENIEILSSPWIQVYNNFPALLDYFPGTHNKLLKNVAFMKSYILEKVKEHQESMDMNNPQDFIDCFLMKMEKEKHN 
QPSEFTIENLEITAVDLFGAGTETTSTTLRYALLLLLKHPEVTAKVQEEIERVIGRNRSPCMQDRSHMPYTDAWHEVQRYIDLLP 
TSLPHAVTCDIKFRNYLIPKGTTILISLTSVLHDNKEFPNPEMFDPHHFLDEGGNFKKSKYFMPFSAGKRICVGEALAGMELFLFL 
TSILQNFNLKSLVDPKNLDTTPWNGFASVPPFYQLCFIPVHHHH 

1039 SEQ ID NO:107 & 108 

ATGGCTAAGAAAACGAGCTCTAAAGGGCGGCCGCCTGGCCCCACTCCTCTCCCAGTGATTGGAAATATCCTACAGATAGGTATTAA 
GGACATCAGCAAATCCTTAACCAATCTCTCAAAGGTcTATGGCCCTGTGTTCACTCTGTATTTTGGCCTGAAACCCATAGTGGTGC 
TGCATGGATATGAAGCAGTGAAGGAAGCCCTGATTGATCTTGGAGAGGAGTTTTCTGGAAGAGGCATTTTCCCACTGGCTGAAAGA • 
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GCTAACAGAGGATTTGGAATTGTTTTCAGCAATGGAAAGAAATGGAAGGAGATCCGGCGTTTCTCCCTCATGACGCTGCGGAATTT 
TGGGATGGGGAAGAGGAGCATTGAGGACCGTGTTCAAGAGGAAGCCCGCTGCCTTGTGGAGGAGTTGAGAAAAACCAAGGCCTCAC 
CCTGTGATCCCACTTTCATCCTGGGCTGTGCTCCCTGCAATGTGATCTGCTCCATTATTTTCCATAAACGTTTTGATTATAAAGAT 
CAGCAATTTCTTAACTTAATGGAAAAGTTGAATGAAAACATCAAGATTTTGAGCAGCCCCTGGATCCAGATCTGCAATAATTTTTC 
TGCTATCATTGATTACTTCCCGGGAACTCACAACAAATTACTTAAAAACGTTGCTTTTATGAAAAGTTATATTTTGGAAAAAGTAA 
AAGAACACCAAGAATCAATGGACATGAACAACCCTCAGGACTTTATTGATTGCTTCCTGATGAAAATGGAGAAGGAAAAGCACAAC 
CAACCATCTGAATTTACTATTGAAAGCTTGGAAAACACTGCAGTTGACTTGTTTGGAGCTGGGACAGAGACGACAAGCACAACCCT 
GAGATATGCTCTCCTTCTCCTGCTGAAGCACCCAGAGGTCACAGCTAAAGTCCAGGAAGAGATTGAACGTGTGATTGGCAGAAACC 
GGAGCCCCTGCATGCAAGACAGGAGCCACATGCCCTACACAGATGCTGTGGTGCACGAGGTCCAGAGATACATTGACCTTCTCCCC 
ACCAGCCTGCCCCATGCAGTGACCTGTGACATTAAATTCAGAAACTATCTCATTCCCAAGGGCACAACCATATTAATTTCCcTGAC 
TTCTGTGCTACATGACAACAAAGAATTTCCCAACCCAGAGATGTTTGACCCTCATCACTTTCTGGATGAAGGTGGCAATTTTAAGA 
AAAGTAAATACTTCATGCCTTTCTCAGCAGGAAAACGGATTTGTGTGGGAGAAGCCCTGGCCGGCATGGAGCTGTTTTTATTCCTG 
ACCTCCATTTTACAGAACTTTAACCTGAAATCTCTGGTTGACCCAAAGAACCTTGACACCACTCCAGTTGTCAATGGATTTGCCTC 
TGTGCCGCCCTTCTACCAGCTCTGCTTCATTCCTGTCCACCACCACCACTGA 

MAKKTSSKGRPPGPTPLPVIGNILQIGIKDISKSLTNLSKVYGPVFTLYFGLKPIWLHGYEAVKEALIDLGEEFSGRGIFPLAER 
ANRGFGIVFSNGKKWKEIRRFSLMTLRNFGMGKRSIEDRVQEEARCLVEELRKTKASPCDPTFILGCAPCNVICSIIFHKRFDYKD 
QQFLNU4EKLNENIKILSSPWIQICNNFSAIIDYFPGTHNKLLKNVAFMKSYILEKVKEHQESMDt4NNPQDFIDCFIi4KMEKEKHN 
QPSEFTIESLENTAVDLFGAGTETTSTTLRYALLLLLKHPEVTAKVQEEIERVIGRNRSPCMQDRSHMPYTDAWHEVQRYIDLLP 
TSLPHAVTCDIKFRNYLIPKGTTILISLTSVLHDNKEFPNPEMFDPHHFLDEGGNFKKSKYFMPFSAGKRICVGEALAGMELFLFL 
TSILQNFNLKSLVDPKNLDTTPWNGFASVPPFYQLCFIPVHHHH 



1365 SEQ ID NO:109 & 110 

ATGGCTAAGAAAACGAGCTCTAAAGGGCGGCCGCCTGGCCCCACTCCTCTCCCAGTGATTGGAAATATCCTACAGATAGGTATTAA 
GGACATCAGCAAATCCTTAACCAATCTCTCAAAGGTCTATGGCCCTGTGTTCACTCTGTATTTTGGCCTGAAACCCATAGTGGTGC 
TGCATGGATATGAAGCAGTGAAGGAAGCCCTGATTGATCTTGGAGAGGAGTTTTCTGGAAGAGGGATTTTCCCACTGGCTGAAAGA 
GCTAACAGAGGATTTGGAATTGTTTTCAGCAATGGAAAGAAATGGAAGGAGATGCGGCGTTTCTCCCTCATGACGCTGCGGAATTT 
TGGGATGGGGAAGAGGAGCATTGAGGACCGTGTTCAAGAGGAAGCCCGCTGCeTTGTGGAGGAGTTCAGAAAAACCAAGGCCfCAC 
CCTGTGATCCCACTTTCATCCTGGGCTGTGCTCCCTGCAATGTGATCTGCTCCATTAT-TTTGGATAAACGTTTTGATTATAAAGAT 
CAGCAATTTCTTAACTTAATGGAAAAGTTGAATGAAAACATCAAGATTTTGAGGAGCCCCTGGATCCAGGTCTACAATAATTTCTC 
TGCTCTCCTTGATTATTTCCCGGGAACTCACAACAAATTACTTAAAAACGTTGCTTTTATGAAAAGTTATATTTTGGAAAAAGTAA 
AAGAACACCAAGAATCAATGGACATGAACAACCCTCAGGACTTTATTGATTGCTTCCTGATGAAAATGGAGAAGGAAAAGCACAAC 
CAACCATCTGAATTTACTATTGAAAGCTTGGAAAACACTGCAGTTGACTTGTTTGGAGGTGGGACAGAGACGACAAGCACAACCCT 
GAGATATGCTCTCCTTCTCCTGCTGAAGCACCCAGAGGTCACAGCTAAAGTCCAGGAAGAGATTGAACGTGTGATTGGCAGAAACC 
GGAGCCCCTGCATGCAAGACAGGAGCCACATGCCCTACACAGATGCTGTGGTGGACGAGGTCCAGAGATACATTGACCTTCTCCCC 
ACCAGCCTGCCCCATGCAGTGACCTGTGACATTAAATTCAGAAACTATCTCATTGGCAAGGGCACAACCATATTAATTTCCCTGAC 
TTCTGTGCTACATGACAACAAAGAATTTCCCAACCCAGAGATGTTTGACCCTCATCACTTTCTGGATGAAGGTGGCAATTTTAAGA 
AAAGTAAATACTTCATGCCTTTCTCAGCAGGAAAACGGATTTGTGTGGGAGAAGCCCTGGCCGGCATGGAGCTGTTTTTATTCCTG 
ACCTCCATTTTACAGAACTTTAACCTGAAATCTCTGGTTGACCCAAAGAACCTTGACACCACTCCAGTTGTCAATGGATTTGCCTC 
TGTGCCGCCCTTCTACCAGCTCTGCTTCATTCCTGTCCACCACCACCACTGA 

MAKKTSSKGRPPGPTPLPVIGNILQIGIKDISKSLTNLSKVYGPVFTLYFGLKPIVVLHGYEAVKEALIDLGEEFSGRGIFPLAER 
ANRGFGIVFSNGKKWKEIRRFSLMTLRNFGMGKRSIEDRVQEEARCLVEELRKTKAS PCDPTFILGCAPCNVICSIIFHKRFDYKD 
QQFLNLMEKLNENIKILSSPWIQVYNNFSALLDYFPGTHNKLLKNVAFMKSYILEKVKEHQESMDMNNPQDFI DCFLMKMEKEKHN 
QPSEFTIESLENTAVDLFGAGTETTSTTLRYALLLLLKHPEVTAKVQEEIERVIGRNRSPCMQDRSHMPYTDAWHEVQRYIDLLP 
TSLPHAVTCDIKFRNYLIPKGTTILISLTSVLHDNKEFPNPEMFDPHHFLDEGGNFKKSKYFMPFSAGKRICVGEALAGMELFLFL 
TSILQNFNLKSLVDPKNLDTTPWNGFASVPPFYQLCFIPVHHHH 



1423 SEQ ID NO: 111 & 112 

ATGGCTAAGAAAACGAGCTCTAAAGGGCGGCCGCCTGGCCCCACTCCTCTCCCAGTGATTGGAAATATCCTACAGATAGGTATTAA 
GGACATCAGCAAATCCTTAACCAATCTCTCAAAGGTCTATGGCCCTGTGTTCACTCTGTATTTTGGCCTGAAACCCATAGTGGTGC 
TGCATGGATATGAAGCAGTGAAGGAAGCCCTGATTGATCTTGGAGAGGAGTTTTCTGGAAGAGGCATTTTCCCACTGGCTGAAAGA 
GCTAACAGAGGATTTGGAATTGTTTTCAGCAATGGAAAGAAATGGAAGGAGATCCGGCGTTTCTCCCTCATGACGCTGCGGAATTT 
TGGGATGGGGAAGAGGAGCATTGAGGACCGTGTTCAAGAGGAAGCCCGCTGCCTTGTGGAGGAGTTGAGAAAAACCAAGGCCTCAC 
CCTGTGATCCCACTTTCATCCTGGGCTGTGCTCCCTGCAATGTGATCTGCTCCATTATTTTCCATAAACGTTTTGATTATAAAGAT 
CAGCAATTTCTTAACTTAATGGAAAAGTTGAATGAAAACATCAAGATTTTGAGCAGCCCCTGGATCCAGATCTGCAATAATCCTTC 
TGCTATCATTGATTACTTCCCGGGAACTCACAACAAATTACTTAAAAACGTTGCTTTTATGAAAAGTTATATTTTGGAAAAAGTAA 
AAGAACACCAAGAATCAATGGACATGAACAACCCTCAGGACTTTATTGATTGCTTCCTGATGAAAATGGAGAAGGAAAAGCACAAC 
CAACCATCTGAATTTACTATTGAAAGCTTGGAAAACACTGCAGTTGACTTGTTTGGAGCTGGGACAGAGACGACAAGCACAACCCT 
GAGATATGCTCTCCTTCTCCTGCTGAAGCACCCAGAGGTCACAGCTAAAGTCCAGGAAGAGATTGAACGTGTGATTGGCAGAAACC 
GGAGCCCCTGCATGCAAGACAGGAGCCACATGCCCTACACAGATGCTGTGGTGCACGAGGTCCAGAGATACATTGACCTTCTCCCC 
ACCAGCCTGCCCCATGCAGTGACCTGTGACATTAAATTCAGAAACTATCTCATTCCCAAGGGCACAACCATATTAATTTCCCTGAC 
TTCTGTGCTACATGACAACAAAGAATTTCCCAACCCAGAGATGTTTGACCCTCATCACTTTCTGGATGAAGGTGGCAATTTTAAGA 
AAAGTAAATACTTCATGCCTTTCTCAGCAGGAAAACGGATTTGTGTGGGAGAAGCCCTGGCCGGCATGGAGCTGTTTTTATTCCTG 
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ACCTCCATTTTACAGAACTTTAACCTGAAATCTCTGGTTGACCCAAAGAACCTTGACACCACTCCAGTTGTCAATGGATTTGCCTC 
TGTGCCGCCCTTCTACCAGCTCTGCTTCATTCCTGTCCACCACCACCACTGA 

MAKKTSSKGRPPGPTPLPVIGNILQIGIKDISKSLTNLSKVYGPVFTLYFGLKPIWLHGYEAVKEALIDLGEEFSGRGIFPLAER 
ANRGFGIVFSNGKKWKEIRRFSI^TLRNEXSMGKRSIEDRVQEEARCLVEELRKTKASPCDPTFILGCAPCNVICSIIFHKRFDYKD 
QQFLNU1EKLNENIKILSSPWIQICNNPSAIIDYFPGTHNKLLKNVAFMKSYILEKVKEHQESMDMNNPQDFIDCFLMKMEKEKHN 
QPSEFTIESLENTAVDLFGAGTETTSTTLRYALLLLLKHPEVTAKVQEEIERVIGRNRSPCMQDRSHMPYTDAWHEVQRYIDLLP 
TSLPHAVTCDIKFRNYLIPKGTTILISLTSVLHDNKEFPNPEMFDPHHFLDEGGNFKKSKYFMPFSAGKRICVGEALAGMELFLFL 
TSILQNFNLKSLVDPKNLDTTPWNGFASVPPFYQLCFIPVHHHH 





